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Automated Gene-Targeting Using Non-Toxic Detectable Markers 

The invention relates to a method that enables automated identification and 
isolation of. cells harbouring a predetermined genetic modification (homologous, 
recombination) using detectable/sortable markers, e.g. fluorescence markers, to. 
identify homologous DNA modifications. Suitable vectors are also provided. . 

Background 

Gene targeting in cells/ including embryonic stem (ES) cells, relies on the 
homologous recombination (HR) between a native chromosomal gene and 
introduced exogenous DNA (Smithies, O. et al., Nature, 317(6034):230-4 (1985); 
Thomas, K. R. et al., Cell, 44(3):419-28 (1986); Doetschman, T. et al., Proc. Natl. 
Acad.. Sci. USA 85(22):8583-7 . (1988); Thomas, K. R., Capecchi, M. R., Cell, 
51(3):503-12 (1987)). The method requires . transfection of the foreign DNA. 
(targeting vector) into the ES cells, usually by electroporation. HR is a very 
inefficient process, working in the range of 1 x 10" 5 - 1 x 10" 6 events per 
electroporated ES cells (for review see Hooper, M.C., Harwood, Embryonal Stem 
Cell: Introducing Planned Changes into the Animal Geneline, New: York (1993)); 
. Most genes of interest are not directly selectable. The identification of homologous 
recombined (gene targeted) ES cells therefore require the application of selection 
mechanism. ....... 

Positive selection is usually achieved by incorporation of an antibiotic resistance 
gene into the targeting vector, located between at least 2 stretches of DNA 
homologous to the targeting region. Upon addition of antibiotic to the culture 
medium of electroporated ES cells, expression of the positive selection marker 
allows survival only of those cells which have stably introduced the electroporated 
DNA into their genome. 

Additionally, negative selection strategies further enrich selective growth and 
identification of HR events. A negative selection marker is added at the 5' or 3' end 
of a . homologous DNA stretch (Mansour, S. L et al., .Nature, 336:348-352 (1988); 
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U.S. Patents Nos. 5,487,992 and 5,464,764): Non-homologous DNA regions outside 
the homologous regions are not incorporated into the genome if HR has occurred. 
Current protocols (U.S. Patents 5,487,992 and 5,464,765) require the addition of a 
compound to the culture medium, to mediate selection, against incorporation of the 
marker, effectively killing ES cells, which have undergone random recombination. 
Negative selection generally achieves enrichments of oniy 2-5fold (Sedivy, J. M., 
Dutriaux, A.,Trends Genet., 15(3) :.88-90 (1999)). ' 
Given optimal vector design to allow homologous recombination, the described 
selection strategies known in the art (e.g. the positive-negative selection strategies, 
disclosed in U.S. Patents 5,487,992 and 5,464,765) allow identification of HR in 
about 5% of all analysed ES cells. Said selection strategies are certainly not optimal 
and it is therefore desirable to 

(a) optimize enrichment of homologous recombinants, 

(b) reduce variability in efficiency of the currently available methods for 
enrichment, . ... 

(c) avoid unwanted potential toxicity of agents for negative or negative 
and positive selection agents. Gancyclovir, the most widely used agent for 
negative selection, inhibits DNA polymerase and therefore cellular 
proliferation (Matthews, T. and Boehme, R.,. Reviews, of Infectious Diseases, 

■ 10(3):490-4 (1998)). Such inhibition may not be tolerable under GMP 
standards, in particular in potential applications of gene targeting for gene 
. . therapy ih humans. . ■-. 

Moreover, the selection procedures known in the art requires ES cell culture over a 
period of 7-8 days, to allow selection to work, and clonal expansion of cells to an 
absolute number of ca. 2000, to enable molecular analysis and further expansion of 
the cell population. However, culture of ES cells requires adherent growth of the 
cells on defined substrate, usually mitotically inactivated „feeder cells". This, in turn, 
necessitates laborious, manual "picking" of identified ES clones from their substrate 
to enable individual clonal growth in culture vessels for maintenance and molecular 
analysis. It was therefore also desirable to reduce the time period required to 
isolate potential HR ES clones as well as the manual labour, i.e. picking of ES cell 
clones, i.e. by increasing the automation of the gene targeting process employing 
current protocols. , ? 
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Fluorescent probes on the. other side are known as a powerful too) for identification 
of molecular events in single cells. The most widely known is the green fluorescent 
protein (GFP) from bioluminescent jellyfish Aequorea victoria. However, 
approximately 30 distinct fluorescent proteins have been discovered and cloned 
from a variety of species (Labas, Y. A. et al., PNAS, 99(7):4256-4261 (2002)),. 
(Matz, M. V. et' al., Nat. Biotechno!., 17(10):906-18 (1999)) for review see: 
(Zhang/J. etal./Nat. Rev! Mdl. Cell. Biol., 3(12):906-18 (2002)). 
For certain reasons such as the sensitivity of the cells to be transfected, optically 
detectable markers including fluorescent probes such as GFP and its variants were 
only seldom employed as detectable .markers in gene targeting experiments. E. g. 
WO 02/06630 discloses that GFP might be utilized instead of a neomycin 
phosphotransferase gene in a gene targeting process similar to the basic positive 
selection protocol. Furthermore WO 03/0022725 discloses that fluorescent gene 
cassettes can be used as an alternative to negative selection markers in order to 
further enrich ES cell clones which underwent homologous recombination. 

A key property of murine ES cells is that they can be maintained indefinitely in vitro 
if cultured in the presence of the cytokine leukemia inhibitory factor (LIF) (Smith, A. 
G. and Hooper, M., Dev. Biol. 121:1-9 (1987); Williams, R. L et al. (1988)). ES 
cells retain the capacity to participate normally in embryogenesis and contribute to 

". all tissues of the mouse embryo when introduced into host blastocysts, including 
the germ line (reviewed by Robertson, E.J., Oxford, UK, IRL Press: 71-112 (1987); 

; Smith, A.G., Sem. Cell. Biol. 3:385-399 (1992)). However, retention of germ-line 
transmission competence is often elusive. It depends absolutely oh adherence to a 
rigorous tissue culture regime, with avoidance of any untoward selective pressures. 
Furthermore, ES cells are only modestly able to amplify if plated as single cells to 
form clonal populations. The plating efficiency is reported to range from 3 to 5% 
(Reid, L. H. et al., Mol. Cell. Biol. 11: 2767-277/ (1991); Templeton, N. S., et al., 
Gene Therapy 4(7): 700-9 (1997)). 

Altogether, low plating efficiency, differentiation accompanied by loss of germ line- 
competence and subsequent failure to produce genetically modified mice is still the 

' key obstacle for the employment of automated sorting methodology for ES cells in 

. gene targeting studies. 
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Methods such as Fluorescence Activated Cell Sorting have been described for 
sorting of human (Schuldiner, MJ. et.al., Stem Cells 21(3):257-265 (2003)) and 
murine (Reddy, S. et al., Proc. Natl. Acad. ScL USA 89:6721-6725 (1992); WO 
03/002272) ES cells, however such, cells were only bulk sorted as pools of marker 
expressing cells for subsequent analysis. - 

High frequency gene targeting and isolation of , single cells with defined 
characteristics in an automated fashion as described in this application is of great 
advantage to improve effectiveness and precision of gene targeting experiments. 

Summary of the Invention 

It was now found that certain vectors for targeted homologous recombination which 
contain one or more expression cassettes coding for a detectable marker and being 
placed outside the targeting cassette (i.e. the region of homology of the vector to 
the genomic DNA) allow rapid and reliable distinction, preferably visual distinction, 
between targeted and non-targeted ES cells in an automated fashion. One example 
of such an optically detectable marker is ZsGreen (Clontech, Palo Alto, CA). A 
particular example is the application of a first fluorescence marker gene (e.g. 
ZsGreen) inside the region of homology (i.e! inside the targeting cassette) and a 
second fluorescence marker gene differing from the first marker gene outside the, 
region of homology. Moreover, it was found that with a gene targeting vector 
containing a positive selection marker (e.g. neomycin) inside the targeting cassette 
and a fluorescent marker outside the region of homology (e.g. ZsGreen) 
identification of both (stably transfected) non-targeted and correctly targeted ES 
cells via optical detection was possible. 



Furthermore, stably gene targeting vector transfected ES cells have been 
automatically analyzed by fluorescence (Cytocon™300; Evotec, Technologies) and 
plated as single ceils (Cytocon™ Single Cell Fraction Collector; Evotec Technologies). 
Both high resolution analysis and single . cell plating can be combined in Image 
Activated Cell Sorting (Elektra, Evotec Technologies). Amplification of clonally 
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plated ES cells enriched, for . gene targeting events allowed for the first time the 
isolation of gene targeting events without need for any manuai picking or sub- 
cloning of cells. Furthermore, the. contact free procedure and gentle cell handling 
kept targeted ES cells, germline-competent and allowed the production of chimeric 
and, by tetraploid complementation (Nagy, A. et al., Proc. NatK Acad. Sci. USA 
90:8424-8428 (1993); Eggan, K. et aL, Proc. Natl. Acad. Sci. USA 98(11) : 6209-14 
(2001)) fully ES cell derived animals.. . 

Altogether, this method allovys for the/first time automated gene-targeting from 
transfection of the gene targeting vector to establishment of totipotent targeted ES. 
clones. 

the invention thus provides . 

(1) a method for isolation of primary mammalian after homologous recombination 
comprising 

(a) transfecting starting primary cells with a; homologous targeting, vector 
comprising ' 

(A) a homologous targeting cassette which comprises (i) a functional DNA 
. segment and a positive selection marker.or.a functional DNA segment and a first 

detectable marker, and (ii) two DNA segments homologous to the integration 

site within the genome of the primary mammalian cells flanking (i); and 
. (B) an expression cassette harboring a DNA sequence coding for a second 

detectable marker different from said first detectable marker, said expression . 
'. cassette (B) being, connected with said homologous targeting cassette (A) so as 
.to allow, distinction. between targeted and non-targeted cells; , . ... . .. 

(b) manual or automatic identification and/or isolation of cells containing the 
positive selection marker or the first detectable marker; and 

(c) manual or automatic identification and/or isolation of cells as homologous 
recombinants by the absence of the second detectable marker; 

(2) a method for preparing transgenic tissues, organs and/or multi-cell organisms 
which comprises utilizing the method as defined in (1) above; and 

(3) a vector for targeted homologous recombination of eukaryotic cells as defined in 
(1) above. . . : 

The vector utilized in the homologous recombination process of (1) above allows : 
direct or indirect distinction between correctly targeted and non-targeted cells.. 
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Indirect distinction can be achieved via addition of non-toxic compounds to the cells 
or via metabolic or enzymatic action of the gene product (detectable marker) 
encoded by the expression cassette. 

By utilizing one or more fluorescence markers as detectable markers, the method 
(1) of the invention . 

a) allows to optically distinguish random DNA integrants (light emission) and HR 
eukaryotic cells, including ES cells (no light emission) (Fig. 2);. .. 

b) enhances isolation and reduces variability of recovery of HR ES cells in 
. individual experiments since the addition of compounds to the culture 

medium is avoided; 

c) allows selective, automated distinction of growing transfected ES cell colonies 
by plating of ES cells, transformed with such light emission targeting vectors, 

, in a way such that preferentially single cell clones grow within a spatially 
separated region (1 well of a multiwell-dish) and subsequent fluorescence 
analysis (such ES clones can then automatically be expanded and taken to 
molecular analysis (Fig. 2)); 
. d) allows selective, automated sorting of growing transfected ES cells for the 
absence of the first detectable marker and/or the presence of the second 
detectable marker and/or by Image Activated Cell Sorting (i.e. fluorescence). 



The method of the invention has the following advantages over current technology: 

(i) The detection strategy allows selective .identification of gene targeted 
ES clones without exposure of ES cells to . potentially toxic chemicals, 
protein or mutagens usually used for negative selection; 

(ii) automated optical selection of clones enriched for homologous 
recombination event by applying gentle sorting machinery equipment 
such as but not limited to Cytocon™300 & Cytocon™ Single Cell 
Fraction Collector (Evotec Technologies) or Elektra ( Evotec 
Technologies); 

. (Hi) delivery of isolated single cells in 1 well of a multiweir dish which . retain 
- the capability of clonal growth; 
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(iv) identification of a higher percentage of clones showing homologous 

recombination compared to standard procedures (i.e. positive- negative 
selection, positive-positive selection) due to specific. Promoter and 
selectable marker configurations (WO 03/002725); 

(v) .. automated expansion, isolation and molecular analysis, of gene 

targeted ES clones with proven characteristics of totipotency as 
demonstrated by the generation of chimeric and tetraploid mice; 

(vi) significant reduction of both manual labour and associated costs for 
gene targeting experiments. ' .. - 

Short Description of the Figures 

Fia 1; Selection mechanism in homologous recombination: A: targeting vector; B: 
homologous genomic DNA; C: homologous recombined DNA, presence of first 
marker, absence of second marker. 

Fig. 2: Scheme of manual, and automated gene targeting process employing novel 
detectable markers (solid arrows indicate experiments described in this application). 
Fig. 3: Generation of targeted ES cells. Scheme of the gene targeting strategy. Both 
constructs, (i) the targeting vector for positive and negative selection (pOMPl) 
(3A), and (ii) the targeting vector for positive and optical detection (pOMP3) (3B) 
were used for insertion of the neo gene into the Rosa26 locus by homologous 
recombination as depicted. E: EcoRV; X: Xbal. 

(3C) Southern blot analysis of genomic DNA from ES cells, transfecte'd . with 
. constructs described in. (Fig 4A) and (Fig..4B). The DNA of ES cells was digested 
with EcoRV and hybridized with probe 1 specific for the rosa26 locus. The sizes of 
the wt and the targeted allele are 11.5 kb and 2.5 kb, respectively. The picture 
shows the analysis of identified non-fluorescent clones. 

(3D^ Southern blot analysis of genomic DNA from ES cells transfected with pOMP3 
and automatically sorted and plated using Cytocon™ 300. 

Fig, 4: Growth of targeted fluorescent and non-fluorescent (marked with arrow) ES 
clones. . . 

Ho. 5: Enrichment of gene targeting frequency in non-fluorescent ES clones. 
Fig. 6 shows schematically the functional segments of the rosa26 locus. 
ElgLZ shows the vector pOMP3. 
Ho. 8 shows the vector POMPl. 
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Fig. 9 shows schematically the modified rosa26 locus. 

Fig. 10 shows schematically the CAGGS promoter followed by an intron, 

Detailed Description of the Invention 

"Targeted vector" or "vector for targeted homologous recombination", according to 
the present invention is a DNA construct that contains sequences "homologous" to 
endogenous chromosomal nucleic acid sequences which flank a desired genetic 
modification. The flanking homologous sequences direct the targeting vector to the 
specific chromosomal location within a genome by virtue of the homology that 
exists between the flanking homologous sequences and the corresponding 
endogenous sequences and introduce the desired genetic modification (i.e. a 
functional DNA segment) by a process referred to as "homologous recombination". 

"Homologous" means two or more nucleic acid sequences that are either identical 
or similar enough that they are able to hybridize to each other or undergo 
intermolecular exchange. ' 

A "flanking DNA" is a segment of DNA that is collinear with and adjacent to a 
particular point of reference. 

"Gene targeting" is the modification of an endogenous chromosomal locus by the 
insertion into, deletion of. or replacement of the endogenous sequence via 
homologous recombination using atargeting-vector.,. 

The "homologous targeting cassette" is that part of the targeted vector that carries 
the desired genetic modification, the flanking homologous sequences and optionally 
functional sequences such as selectable or detectable markers. 

The "expression cassette" is that part of the targeted vector that enables .distinction 
of non-targeted and correctly targeted stably transfected cells by determining its 
presence or absence in the transfected cellular system. The two cassettes are 
"connected" through covalent linkage of an arbitrary number of nucleotides. 



WO 2004/070040 PCT/EP2004/000968 

A "detectable marker" only allows for the detection and subsequent isolation of .a 
transfected cell, whereas a "selectable marker" or "selection marker" provides for a 
selection advantage of the. cell containing said marker over cells devoid of said 
marker and therewith allows for the enrichment of rare transfected cell expressing 
the marker from the majority of treated cells in the population... 

"Multi-cell organisms" according to the present invention refers to non-human 
mammalians. The "transgenic tissues" or "transgenic organs" are derived from. the 
multi-cell organisms referred to above. 

A "non-human organism", or "non-human mammal" is an organism or mammal that 
is not normally accepted by the public as being human. 

As set forth above, the vector used for targeting the primary mammalian cells in 
the method of (1) above has the following properties: The homologous targeting 
cassette (A) comprises (i) a functional DNA segment and a positive selection 
marker or a functional DNA segment and a first detectable marker, and (ii) two DNA 
segments homologous to the integration site within the genome of the primary, 
mammalian cells flanking said functional DNA segment and said positive selection, 
marker or said functional DNA segment and said first detectable marker. The 
expression cassette (B)' comprises a DNA sequence coding for a second detectable 
marker different from said first detectable marker. 

It is preferred that the expression cassette (B) comprises a gene coding for said 
first detectable marker under the control of a promoter (e.g. constitutive, inducible, 
etc.) active in primary mammalian cells. Preferably the promoter is a constitutive 
promoter including, but not limited to, CAGGS (Niwa, H.K. et al., Gene 108(2):193- 
9 (1991)), CMV (Bi, J.X. et al., Biotechnol. Bioeng. 81(7):848-54 (2003)), c-fos (Bi, 
J.X. et al., Biotechnol. Bioeng. 81(7):848-54 (2Q03)), PGK (Adra, C.N. et al., Gene 
60:65-74 (1987)), SV40 (Southern, P.J., Berg, P., J. Mol. Appl. Genet. 1:327-341 
(1982)), elongation factor lalpha (eFlalpha), RNA polymerase II (Soriano, P. et al., 
J. Virol. 65(5):2314-9 (1991)) and TK. 
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Moreover, it is preferred that said first and/or second detectable marker are a non- 
toxic, directly, or indirectly detectable compound or the marker(s) are gene(s) 
expressing membrane bound protein(s) or protein(s) with membrane anchoring 
signal, sequence, requiring that such proteins are . not expressed in the transfected 
cell of interest. Such membrane bound proteins can be identified by binding to 
fluorescence or otherwise labeled detectable antibodies.' 

Preferably the detectable marker is a compound which is detectable by colorimetric 
(Lowry method, Bradford reaction (Comassie blue), sulforhodamine B, .6- 
galactosidase (lacZ), placental alkaline . phosphatase . (PAP)), fluorescence, 
bioluminescence such as luciferases from phengodid beetles, which can produce 
green to red bioluminescence Viviani, V.R., Cell Mol. Life Sci. 59(ll):1833-50 
(2002)) phosphorescence detection methods or the like (for review of probes see 
(Haughland, R.P., Molecular Probes. Handbook of fluorescent probes and research 
chemicals, 172-180, 221-229 (1992-1994); Freshney, R.L, Culture of animal cells, 
3 rd ed., Wiley & Sons, Inc. (1994)). Most preferred is the use of a fluorescence 
protein/peptide. ' 

In another preferred embodiment, the first and/or second detectable marker, 
preferably the second detectable marker, is a membrane bonded protein selected 
from CDla, CD2, CD3, CD4, CDS, CD6, CD8, CD11, CD14, CD15, CD16, CD19, 
CD20, CD22, CD25, CD27, CD30, CD33, CD34, CD43, CD45, CD56, CD61, CD62,. 
CD69, CD71, CD90, CD105, CD117, CD123, ^0133,^0138, BDCA-2, BDCA-3, 
BDCA-4, CRTH2, ErbB-2, Ep-CAM (Human Epithelial Antigen. HEA), etc. Particularly., 
preferred is that the positive Selection marker is neomycine and the second 
selectable marker is selected from ZsGreen CD4 and CD8, or the first and. second 
selectable markers are ZsGreen and HcRed.. 

According to the invention the expression cassette may be positioned 5' or 3' of a 
homologous targeting cassette. If the homologous targeting cassette harbors at 
least two regions of target gene homology which differ in length, the position of the 
expression cassette is preferably connected with the longer region of homology. 

According to the invention the primary mammalian cells include pluripotenf arid 
totipotent cells and preferably are embryonic stem cells (hereinafter shortly referred 
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to as "ES cells") (excluding human ES cells if this is not patentable). Preferably the 
primary mammalian cells are derived from a rodent, such as mouse, rat, etc. 
Particularly preferred are mouse ES celis. 

In a more preferred embodiment the vector of (1) above the functional, DNA 
segment of the targeting cassette is a DNA sequence encoding a gene of interest, 
or a part of a gene . of interest, or is a functional DNA sequence which can be 
converted into such DNA sequence encoding a gene of interest, or intronic 
sequence, or is a regulatory functional DNA sequence, including, but not limited to, 
splice receptor of splice donor or acceptor sequences, and recombinase recognition 
site(s) (RRS(s)). 

The functional DNA segment may further comprise gene expression control 
elements including, but not limited to, ubiquitous or tissue specific promoter, either 
constitutive or inducible, a polyadenylation signal, i.ntron sequences, recombinase 
recognition site(s), enhancer recognition site(s), and matrix attachment region(s) 
(MAR). 

The positive selection marker according to the present invention may be a DNA 
sequence encoding a protein conferring resistance against cell poison, including, but 
not limited to, neomycine (Beck, E. et ai., Gene 19(3) :327-36 (1982)), 
. hygromycine (Bernard, H. U. et al., Exp. Cell Res. 158(1) :237-43 (1985); Gordano, 
T. J., McAllister, W. T.,- Gene 88(2):285-8 (1990); Santerre, R. F. et al., Gene 30(1-. 
3):147-56 (1984)), puromycine N actyltransferase (puro) (de la: Luna, et.al.,. 
Gene 62(1): 121-6 (1988); de la Luna, S.,.Ortin, J., Methods Enymol. 216:376-85 
(1992)), E. colt xanthine guanosine phosphoribolsyltransferase (gpt) (Spring, K. 3. 
et al., Biochim. Biophys. Acta 1218(2): 158-62 (1994)), blasticidin (Kobayashi, K. et 
al., Agric. Biol. Chem. 55(12):3155-7 (1991); Kimura, M. et al., Mol. Gen. Genet. 
242(2): 121-9 (1994)), hypoxanthine-guanosine phosphorybosyltransferase (Hprt) 
(Albertini, R. J. et al., Nature 316(6026):369r71 (1985); Lester, S. C. et al., 
Somatic Cell. Genet. 6(2):241-59 (1980)), herpes simplex virus type 1 thymidine 
kinase (hsvTK) (Borrelli, E. et al., Proc. Natl. Acad. Sci. USA 85(20):7572-6. 
(1988)), and adenine phosphoribosyltransferase (aprt) (Lester, S. C. et.al., Somatic 
Cell. Genet. 6(2):241-59 (1980)) - HPRT, TK and aprt.in positive selection schemes 
require ES cells with mutations in their endogeneous Hprt or TK loci - or may be a 
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DNA sequence, conferring superior metabolic properties to the cells, as compared to 
non-transfected (starting) ceils, including the utilization of xanthine, adenine, etc. 

According to the present invention the first and/or second marker are DNA 
sequences encoding a protein allowing direct. optical detection, such as by detection 
of fluorescence, chemiiuminescence or the like, or indirect optical detection, such as 
a colorimetric assay or the like, provided that said second. detectable marker differs 
from said first detectable marker in order to allow . separate detection. It is 
1 particularly preferred that the first, and/or second detectable marker peptide is a 
fluorescence protein including, but not limited to, green fluorescence protein (GFP) 
and homologous, analogous or color variants thereof, and Reef Coral Fluorescent 
Proteins (RCFPs) such as AmCyan, ZsGreen, ZsYellow, DsRed, AsRed .and HcRed 
(Clontech, Palo Alto, CA). Particularly preferred are the fluorescent peptides 
mentioned in the following Table 1 (Labas, Y.A. et al., PNAS 99(7):4256-4261 
(2002)): 



Table 1: Summary of family of GFP-like proteins 



Protein ffi 


Original ID 


GenBank accession # 


Genus species 


amajGFP 


amFP486 


AF 168421 


Anemonia mcijano 


dsfiGFP 


DsFP483 


AF 168420 


Discosoma striata 


clavGFP 


CFP484 


AF 168424 


Clavularia sp. 


GFP 




M 62653 


Aequora victoria . 


cgigGFP 




AY 037776 


Condylactis gigantea 


hcriGFP 




AF 420592 


Heteractis crispa 


ptilGFP 




AY 015995 


Ptilosarcves sp:- ■'- - 


rmueGFP 




AY 015996 


Renilla muelleri 


zoanGFP 


zFP560 


AF 168422 


Zoanthus sp. . 


asulGFP 


asFP499 


AF 322221 


Anemonia sulcata 


dis3GFP 




AF 420593 


Discosoma sp.3 


dendGFP 




AF 420591 


Dendronephthya sp. 


mcavGFP 




AY 037769 


Montastraea cavernosa 


rfloGFP 




AY 037772 


Ricordea florida 


scubGFPl 




AY 037767 


Scolymia cubensis 


scubGFP2 




AY 037771 


Scolymia cubensis 


zoanYFP 




AF 168423 


Zoanthus sp. 


DsRed 


drFP583 


AF 168419 


Discosoma sp.l 


dis2RFP 


dsFP593 


AF 272711 


Discosoma sp.2 


zoan2RFP 




AY 059642 


Zoanthus sp.2 


meavRFP 




AY 037770 


Montastraea cavernosa 


rfloRFP 




AY 037773 


Ricordea florida 
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asuiCP 


asCP 


AF 246709 


Anemonia sulcata 


hcriCP 


hcCP 


AF 363776 


Heteracis crispa 


cgigCP 


cpCP 


AF 363775 


Condylactisjrigantea 


cpasCP 


cpCP 


AF 383155 


Condylactis parsiflora 


gtenCP 


gtCP 


AF 383156 


Goniopora tenuidens 



It is moreover preferred that the positive selection marker or the second detectable 
marker are flanked by RRSs in, order to allow removal of the marker by means of a 
recombinase after the proper recombination product was established. ' 
The flanking homologous DNA segments may have a length of 0.1 to 20 kb,\ 
preferably 0.5 to 10 kb. The actual flanking DNA sequences depend on the locus of 
; integration, suitable sequences are those homologous to the.Rosa26, HPRT, beta- 
actin, GAPDH locus or the like of eukaryotic cells. 

In a particularly preferred vector of the invention the first detectable marker is 
ZsGreen and the constitutive promoter is the CAGGS promoter. The targeting 
cassette in such a vector may contain a positive selection marker as defined above 
or a second fluorescence marker differing from the first marker. Suitable marker 
combinations out of the ones mentioned in Table I are known in the art. Reef Coral 
Fluorescent Proteins (RCFPs) such as AmCyan, ZsGreen, ZsYellow, DsRed, AsRed 
' and HcRed (Clontech, Palo Alto, CA) each emit a distinct . wavelength. The 
corresponding cDNAs were isolated from nonbioluminescent reef corals (class.. 
Anthozoa), and have been optimized for bright emission, fast chromophore 
maturation, and codon optimized for increased expression in mammalian cells. 
Because of their distinct spectra RCFPs can be used in combination to visualize 
multiple events simultaneously. 

in a most preferred vector the homologous sequences of the targeting cassette are 
homologous to the mouse Rosa26 locus and the selection marker is a loxP flanked 
neomycin resistance gene (see bp 1287 to 3233 of SEQ ID NO: 6), preferably said 

vector has the sequence of SEQ ID. NO. 6. 

■ 

■ The method (1) of the invention comprises, but is not limited to, the following 

; 'Steps:' .. *■ ' ' ; '* ■-■ " . : " " ' -'■ " r; ' 
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1. : Generation of a gene targeting construct harbouring one or more expression 

cassettes coding for non-identical optical markers, e.g. two fluorescence 
molecules (ZsGreen, AmCyanl or ZsYellowl or DsRed2, DsRedExpress or 
AsRed2, or HcRedl; Clontech, Palo Alto,CA) under control of a constitutive 
. CAGGS promoter (Niwa, H. et al., Gene, 108(2):193-9 (1991)). 

2. Gene targeting by homologous recombination in embryonic stem (ES) cells 
with this, vector. 

3. Identification of preferentially homologous recombined ES clones harbouring 
■. the second detectable marker, not harbouring the first detectable marker. 

4. Isolation of such clones by 

(a) manual picking of non-fluorescent clones under fluorescence light emission 
source; 

(b) limited dilution plating of electroporated ES cells in multi-well cell culture dishes 
at. a pre-determined concentration to enable preferential growth of single 
clones/well; 

(i) manual sub-culture and molecular analysis of identified non-fluorescent 
colonies by current protocols; 

(ii) automated sub-culture and molecular analysis of identified non- 
fluorescent colonies; 

(c) automated cell sorting, preferably by image activated cell sorting, of 
preferentially homologous recombined ES cells expressing or not expressing the 
detectable fluorescent marker and plating of sorted cells at single cell density in 
multi-well tissue culture, plates. .... . ... 

Said automated sorting is preferably performed with one of the following devices A 
to C with the particular settings indicated: . 

(A) Cvtocon™ 300 fEvotec Technologies) equipped with Cytocon™ Single Cell 
Fraction Collector (Evotec Technologies). Parameters are preferably as follows: 

I) Software: Cytocon™ software "SWITCH" . 

II) Cytocon™ 300 sorter chip mounted on an Olympus IX-50 fluorescence 
microscope equipped with a 100 W lamp and fluorescein filter set (BP510- 
550, DM570, BA590). Single cells are preferably caged within the field cage . 

. using 5 V at 800 kHz. 

III) Cytocon™ Fluidics system connected to the Cytocon™ 300. sorter chip, : 



WO 2004/070040 PCT/EP2004/000968 

' consisting of 

a) precision syringe pump . 

- rate pumpl: 1:1000-1500 pl/s 
■ rate pump 2: -488 pl/s 
• b) Fluidic block 
. c) Injector v ^ 

a. Hamilton syringe, 5 - 10 pi injection volume 
1) : Cytocon™ chip driver, settings as follows: 

a. Mode Zero: 605 

b. Mode One: 4.88 

Detached fluorescent & nonfluorescent ES cells are preferably resuspended in an 
isotonic buffered solution (Cytocon-bufferll™) at a density of 1x10 s cells/500jil. 
Preferably for reduction of electrolytes buffering is achieved with but not limited to 
carbohydrates such as glucose or insitol. Exposure to such buffer is preferably 
minimized to less than 30 minutes, more preferably less than 20 jninutes. 
Fluorescence is assessed by eye (Olympus IX-50 fluorescence microscope equipped 
with a 100 W lamp and fluorescein filter set (BP510-550, DM570, BA590); objective 
10x/0.30 Phi UplanFi). 

Single cells showing no fluorescence are flushed out of the Sorter Chip with ES cell 
culture medium and deposited with aid of Cytocon™ Single Cell Fraction Collector 
(Evotec Technologies) into separate wells of a 96 well microtiter plate pre-filled with 
ES cell culture medium with the following parameters^: Sheath flow: 800000 pl/s 
(0.8^1/s) " - . 

(B^ Elektra fEvotec Technologies^ 

Sorter chip mounted on an Olympus IX-50 fluorescence microscope equipped with 
a 100 W lamp and fluorescein filter set (BP510-550, DM570, BA590). Single cells 
are preferably caged within the field cage using 6V, 700 kHz. Fluidics system 
connected to the sorter chip, consisting of precision syringe pump (rates pumpl: 
1:1000-1500 pl/s, pump 2: -488 pl/s). Injector with Hamilton syringe (10 ul 
injection volume) 

Detached fluorescent & nonfluorescent ES cells are preferably resuspended in an . 
isotonic buffered solution (Cytocon-bufferll™) at a density of 1x10 s cells/500|il. 
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Preferably for reduction of electrolytes buffering is achieved with but not limited to . 
. carbohydrates such as glucose or insitol. Exposure to such buffer is preferably 
minimized to less than 30 minutes, more preferably less than 20 minutes. 
Fluorescence is assessed by eye (Olympus IX-50 fluorescence microscope equipped 
':' with a 100 W lamp and fluorescein filter set (BP510-550, DM570, BA590); objective 
. lOx/0.30 Phi UplanFl). ' 
Single cells showing no fluorescence are flushed out of the Sorter Chip with ES cell . 
culture medium and deposited into separate wells of a 96 well microtiter plate pre- 
filled with ES cell culture medium with the following parameters: Sheath flow: 
800000 pl/s (0.8jii/s) . . 

(C) Magnetic Bead Separation is preferably performed with this BD™ IMag Cell 
Separation System (BD Bioscience), more preferably with the MACS® Technology 
(Milteny Biotec) based on the use of MACS MicroBeads, MACS Columns and MACS 
Separators; or separation is performed with the CliniMACS® Cell Selection System. 

The method (2) of the. invention for preparing transgenic tissues, organs and/or 
multi-cell organisms (except for human being or human* tissues and organs, if such 
are not patentable under the respective regulations) is exemplified by the following 
steps:* . 

(a) Generation of genetically modified organisms by , • ; 

(aa) ES cells.' Such cells can be used to create genetically modified, rats or 
mice by standard blastocyst injectioa technology . (Robertson., E. .1. Editor.,,. 
IRL Press: Oxford, UK, 71-112 (1987)), aggregation (Wood, S. A. et .ah. 
Nature 365(64411):87-89 (1993)), tetraploid blastocyst injection (Wang, Z. 
Q. et al., Mech. Dev. 62(2): 137-45 (1997)), nuclear transfer and cloning 
(Wakayama, T. et al., Proc. Natl. Acad. Sci. USA 96(26): 14984-9 (1999)), ES 
cells derived from other organism such as rabbits (Wang, Z. Q. et al., Mech. 
Dev. 62(2): 137-45 (1997); Schoonjans, L. et al., Mol. Reprod. Dev. 
45(4) :439-43' (1996)), chickens (Pain, B. et al., Development 122(8):2339- 
'. 48 (1996)), or other species; 

(bb) modified protoplasts to generate genetically modified plants; 
(cc). nuclear transfer from modified eukary otic ceils to oocytes to generate 
• cloned organisms with modified allele (Wakayama, T. et al., Proc. Natl. Acad. 
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Sci. . USA 96(26): 14984-9 (1999); Baguisi, A. et al., Nat. Bibtechnol. 
17(5):456-61 (1999); Wilmut, I. L et al., Nature 385(6619):810-3 (1997); 
.Wilmut, I. L et al., Reprod. FertiL Dev. 10(7-8):639-43 (1998); Wakayama, 
T., Protein, Nucleic Acid Enzyme 45(13 Suppl.):2005-14 (2000); Wakayama, 

• T. et al., Nature 3?4(6691):369-74 (1998); Rideaut, W. M., Nat. Genet. 
24(2):109-10(2000); Campbell, K. H.. et al., Nature, 380(6569) :64-6 
(1996)); 

(dd) cell-fusion to transfer the modified allele to another cell, including 
transfer of engineered chromosome(s), and uses of such cell(s) to generate 
organisms carrying the modified allele or engineered chromosome(s) 
(Kuroiwa, Y. et al., Nat. Biotechnol. 18(10): 1086-90 (2000)). 

(b) Cloning: The techniques used to construct DNA vectors described herein are 
standard molecular biology techniques well known to the skilled molecular biologist 
(Sambrook, J. Fritsch, E. F. a.M., Molecular Cloning: a Laboratory Maunal, 2 nd ed., 
vols. 1, 2 and 3, Cold Spring Harbor, NY, USA, Wiley Interscience, NY (1989)). 

(c) Gene modifications including 
(aa) deletion of coding sequences, gene segments, or regulatory elements; 4 
(bb) alteratiqns(s) of coding sequence, gene segments, or regulatory 
elements including substitutions, additions, and fusions (e.g. epitope tags or 
creation of bifunctional proteins); 

(cc) insertion of new coding regions, gene segments, or regulatory elements, 
such as those for selectable marker genes or reporter genes or putting new 
genes under endogenous transcriptional control;- - 
(dd) creation of conditional alleles, e.g. by introduction of loxP sites flanking 
the region to be excised by Cre recombinase (Abremski, K., Hoess, R., J. Biol. 
Chem. 259(3): 1509-14 (1984)) or FRT sites flanking the region to be excised 
by FLP recombinase (Andrews, B. J. et al., Cell 40(4):795-803 (1985); 
Meyer-Leon, L. et al., Cold Spring Harb. Symp. Quant. Biol. 49:797-804 
(1984)); or 

(ee) replacement of coding sequences or gene segments from one species 
with orthologous coding sequences from a different species, e.g. replacing a 
murine genetic locus with the orthologous human genetic locus to engineer a 
mouse where that particular locus has been 'humanized'. 
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'(d) Introduction of vectors into eukaryotic cells: Using standard methodology, such 
as transfection mediated by . calcium phosphate, lipids, or electroporation 
(Sambrook, J. Fritsch, E. F. a.M., Molecular Cloning: a Laboratory. Maunal, 2 nd ed., 
vols. 1, 2 and .3, Cold Spring Harbor, NY, USA, Wiley Interscience, NY (1989)); 
.(e) Selection: By exposure to selection agents/depending on the selectable marker 
gene that has been engineered into the vector. If the selectable marker is the 
neomycin phosphotransferase (neo) gene (Beck, E. et al., Gene 19(3):327-36 
(1982)) then cells that have taken up the vector can be selected in G418-containing 
media; cells that do not have the vector will die whereas cells that have taken up 
the vector will survive (Freshney, R. I., Culture of animal cells, 3 rd ed., Wiley & 
Sons, Inc. (1994)). Other suitable selectable markers, include any drug that has 
activity in eukaryotic cells, such as hygromycin B Santerre, R. F. et al., Gene 30(1- 
3):147-56 (1984); Bernard, H. U. et al., Exp. Cell. Res. 158(l):237-43); Giordano, 
T. J., McAllister, W.T., Gene 88(2):285-8 (1990)), Blasticidin (Izumi, M. et al., Exp. 
Cell.. Res. 197(2):229-33 (1991)) and other which are familiar to those skilled in the 
art.. . . 1 

The invention is further described by reference to the following examples, which is, 
however, not to be construed so as to limit the invention. 

'. Example 

As example for optical identification of ES clones, the fluorescent molecule ZsGreen 
gene (Clontech) was chosen. The Rosa locus, of -the. mouse .(Fig. 6,-. SEQ.. ID ..NQ:3}! 
was chosen for homologous recombination. The targeting strategy is outlined in 
Figures 3A, B and C and D. 

1. Rosa Targeting Vector: A 129 Sv/Ev-BAC library (Incyte Genomics) was screened 
with a probe against exonZ of the Rosa26 locus (amplified from mouse genomic 
DNA using Rscreenls (G ACAGGACAGTGCTTGTTTAAGG ; SEQ ID NO:l) and 
Rscreenlas (TGACTACACAATATTGCTCGCAC; SEQ ID Np:2)). PCR conditions were 
. as follows: 95°C, 2 min, followed by 30 cycles: 95°C; 30 s; 60°C, 30 s; 72°C, 30 s; 
72°C, 7 min; followed by 20°C, 2 min. Out of the identified BAC clone a 11 kb 
EcoRV subfragment was inserted into the Hindll site of pBS. Two fragments, a 1 kb 
SacII/Xbal fragment (SEQ ID NO:4) and a 4 kb Xbal-fragment (SEQ ID NO:5) were 
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used as homology arms and. inserted into a vector consisting of a FRT-flanked 
neomycin resistance gene and a PGK-TK-pA expression cassette for negative 
selection (SEQ ID NO: 13). '.'.' t '\.\ 

2 pOMPl and pOMP3 vector Construction: For construction of pOMPl (Fig. .8 and 
SEQ ID NO:7), the FRT-flanked neomycin resistance gene was replaced by a loxP- 
, flanked neomycin resistance gene (SEQ ID NO:9). The ZsG.reen gene was placed 
under the control of the CAGGS promoter (SEQ ID NO:ll) followed by a synthetic 
intron (pMultilink CAGGS-zsGreen; Fig., 10,: SEQ ID. NO:10). This expression 
cassette was used for a. replacement of PGK-TK-pA of pOMPl resulting in pOMP3 
(Fig. 7 and SEQ ID NO:6). pOMPl.and pOMP3. were used for the targeting 
experiments (Figures 3A, 3B). For ES cell electroporation, vectors were linearized at 
the I-Scel restriction site. 

3. Introduction of pOMP vectors into ES cells 

The ES cell line Art4'.12 (Eggan, K. et al., Nat. Biotechnol., 20(5):p. 455-9 (2002)) 
was grown on mitotically inactivated feeder .layer (Mitomycin C (Sigma M-0503)) 
comprised of mouse embryonic fibroblasts in medium composed of Ix DMEM high 
Glucose (Invitrogen 41965-062), 4 mM Glutamin (Invitrogen 25030-024), Ix Non 
Essential Amino Acids (Invitrogen 11140-035) ImM . Sodium Pyruvat (Invitrogen 

.11360-039), 20 mM Hepes (Invitrogen 15630-056), 0.1 mM S-Mercaptoethanol 
(Invitrogen 31350-010), 2xl0 6 p/l Leukemia Inhibitory Factor (Chemicon ESG t 
1107) and 15% fetal bovine serum (PAN- 1302-P220821). :.. .- . ... - - 

Linearized vectors pOMPl and pPOMP 3 were introduced into, the cells by 
electroporation using a Gene Pulser with Capacitance Extender (Biorad). 
Rapidly growing cells were used on the first day following the last passage. Upon 
trypsinization (Invitrogen 25200-056) cells were resuspended in PBS (Invitrogen 
20012-019) and preplated for 25 min on gelatinized 10 cm plates to remove 
unwanted feeder cells. The supernatant was harvested, ES cells were washed once 
in PBS and counted (Neubauer hemocytometer). 10 7 cells were mixed with 30 pg of 
I-Sce linearized vector in 800 pi of transfection buffer (20 mM Hepes, 137 mM NaCI, 

. 15 mM KCl/0.7 mM Na 2 HP0 4 , 6 mM Glucose 0,1 mM B-Mercaptoethanol in H 2 0) and 
electroporated using a Biorad Gene Pulser with Capacitance Extender set on 250 V. 
and 500 pF. Electroporated cells were seeded at a density 0.25xlQ 7 cells per 10 cm 
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tissue culture dish onto a previously prepared layer of neomycin-resistant 
inactivated mouse embryonic, fibroblasts (MEF). For automated selection of non- 
fluorescent clones (experiment 6), cells were seeded on 6 cm gelatine-coated tissue 
culture dishes without MEF layer. 

4. Manual- identification of ES cells containing a targeted disruption of the Rosa 
iocus 

48 h after electroporation, the medium was replaced on all dishes by medium 
containing 250 \ig/n\\ Geneticin (Invitrogen 10131-019) for positive selection of 
G418 resistant ES clones. 

5 day after electroporation, Vi of the dishes containing ES cells electroporated with 
vector pOMPl were negatively selected by addition of 2 uM Gancyclovir (Cymeven®, 
Roche) 

On day 8 after electroporation ES colonies were isolated as follows: 
Medium was replaced by PBS and the culture dishes were placed on the stage of a 
binocular (Nikon SMZ-2B). Using low magnification (25 x) individual ES clones of 
undifferentiated appearance were removed from the surface of the culture dish by 
suction into the tip of a 20 ul pipette (Eppendorf). The harvested clones were 
placed in individual wells of 96 well plates containing 30 II of 2.5% Trypsin 
(Invitrogen). After disruption of clones by pipetting with a multichannel pipette 
.(Eppendorf), cells were seeded onto feeder containing 96 well plates with pre-, 
equilibrated complete ES-medium. ' : .. •■ 

- For pOMPl positive selected clones (Geneticin): .and .positive/negative. ..selected, 
clones (Geneticin and Gancyclovir) were harvested as described above. 
For pOMP3 only positive selected clones were isolated either at. random as 
described above, or the ES clones were placed under a binocular (Leica MZFL III) 
with fluorescent light source (GFP3 Filters set) and bright-field illumination. At dim 
light, clearly identifiable non-fluorescent ES clones (Figure 4) were isolated as 
described above. 

Cells were grown for 3 days with daily medium changes and then splitted 1 : 2 on 
gelatinized (Sigma G-1890) .96 well plates. 3 days after splitting cells were lysed, 
genomic DNA was prepared and analysed by Southern blot. 
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5. Identification of non-fluorescent transfected cells by plate-reader analysis 
Electroporated bells were seeded at a density of 1 cell per well in a 96 multi-well 
plate (Nunc, Order #136101), coated with a previously prepared layer of neomycin- 
resistant inactivated mouse embryonic fibroblasts. 

48 h after electroporation,. the medium was replaced on all dishes by medium 
containing 250 (jg/ml Geneticin (Invitrogen 10131-019) for positive selection of 
G418 resistant ES clones. . 

5 days upon transfection individual wells were scored by plate-reader analysis . 

a) by O.D. measurement for the presence of single ES clones 

b) by fluorescence measurement (GFP3 filter set; Leica, Bensheim, Germany) 
. for the location of non-fluorescent clones. 

Identified non-fluorescent ES clones were expanded and taken to molecular analysis 
by Southern blotting (see Fig. 3A, 3B) according to standard procedures. 

6. Image Activated Cell Sorting for the Analysis of Transfected Cells and Single Cell 
Plating 

48 h after electroporation, the medium was replaced on all dishes by medium 
containing 250 pg/ml Geneticin (Invitrogen 10131-019) for positive selection of 
G418 resistant ES clones. 

On day 7 or 8 after electroporation ES colonies were automatically sorted by 
Cytocon™ 300 (Evotec Technologies) based on the presence or absence of 
fluorescent light emission of single clones. Non-fluorescent ES cells were 
automatically plated employing ~a Cytocon™, Single Cell Fraction^Collector (Evotec. 
Technologies) in single entities of a 96-well plate. 

Transfected ES cells were enzymatically removed from by trypsin addition 
(Invitrogen 25200-056). After centrifugation, cells were resuspended in Cytocon 
buffer II at a density of 1x10 s cells/500jal. Five to ten jil of 1:5 in buffer II diluted 
cells were applied to Cytocon™ 300 using a Hamilton syringe. Sorting and single 
cell plating were undertaken with the following. Cytocon™ 300 settings: Generator 
modes 0 605; .1 4.88; pump rates pumpl: 1:1000-1500 pl/s; pump 2: -488 pl/s; 
"sheat flow": 800000 pl/s (0.8jil/s), A Cytocon™ Sorter Chip was mounted on a 
fluorescent ; microscope and single cells were caged within the field cage. 
Fluorescence was assessed by eze and non-fluorescent cells were automatically 
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seeded with the aid of : the Cytocon™ Single Cell Fraction Collector (Evotec 
technologies), into single entities (individual well of 96-well tissue .culture plate 
(Greiner Cellstar), coated, with MEF cells and pre-equi.librated complete. ES- 
medium). 

Clones were grown for 7 days with one medium change and then replated and 
cultured 1:1 onto a second 96-well tissue culture plate. Clones were subsequently 
expanded, aliquots cryoconserved and genomic DNA prepared and molecularly 
analysed by Southern blot 

7. Injection of ES cells into diploid and tetraploid blastocysts and generation of 
chimeric and ES mice . 

Embryo culture was carried out in microdrops on standard bacterial petri dishes 
(Falcon) under mineral oil (Sigma). Modified CZB media (Chatot et al, Supra (?)) 
was used for embryo culture unless otherwise noted. Hepes buffered CZB was used 
for room temperature operations; . 

Chimeras were generated as described (Hogan, B.> Beddington, R., Costantini, F. & 
Lacy, E. eds. Manipulating the Mouse Embryo, a Laboratory Manual, 2 nd ed. Cold 
Spring Harbor, New York: Cold Spring Harbor, Laboratory. Press, .1994). Briefly , 
Balb/C host embryos were harvested, at dpc 3.5 from the uterus of superovulated 
Balb/c OlaHsd females (Harlan Netherlands) mated with Balb/c OlaHsd males 
(Harlan Netherlands). For microinjection, 5-6 blastocysts, were . placed ia.a ,dro.p...Qf . 
DMEM with 15% FCS under mineral oil. A flat tip, piezo actuated microinjection- 
pipette with an internal diameter of 12-15 \sm was used to inject 20 - 30 ES cells 
into each blastocyst. After recovery, 8 injected blastocysts were transferred to each 
uterine horn of 2.5 days post coitum, pseudopregnant NMRI females that had been 
mated with vasectomized males. Litters were controlled, and pups alive by that 
time were counted as surviving pups. ES cell contribution was judged by coat color 
chimerism. 

Production of mice by tetraploid embryo complementation has been previously 
described (Eggan "et al., PNAS'98: 6209-6214 (2001)). After administration of 
hormones,, superovulated . B6D2Flfemales were mated with B6D2F1 males. 
Fertilized zygotes were isolated from the oviduct and any remaining cumulus cells 
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were removed with hyluronidase. After overnight culture, two-cell embryos, were 
felectrofused to produce one cell tetraplold embryos using a CF150-B cell fusion 
instrument from BLS (Budapest, .Hungary) according to the manufacturers, 
instructions. Embryos that had not undergone membrane fusion within 1 hour were 
discarded. Embryos were then , cultured in vitro to the blastocyst stage. For 
microinjection, 5-6 blastocysts were placed in a drop of DMEM with 15% FCS under , 
mineral oil. . A flat tip, piezo actuated microinjection-pipette with an internal 
diameter of 12-15 jjm was used to inject 15 ES cells into each blastocyst; After 
recovery, ten injected blastocysts were transferred to each uterine horn of 2.5 days 
post coitum, pseudopregnant NMRI females that had been; mated with 
vasectomized males. Litters were controlled, and pups alive by that time were 
counted as surviving pups; 

8. Results 

(a) Effect of fluorescence on ES cell growth 

Expression of ZsGreen; (pOMP3) did not negatively influence ES cell colony 
formation. The mean number of resistant ES colonies per lxlO 7 ES cells transfected , 
with, the ZsGreen expression vector pOMP3 is similar to the number of ES colonies . 
transfected with pOMPl (see Table 2). 



Table 2: Fluorescence does not negatively effect clonal ES cell growth 





2 nd marker gene. 


2 nd Selection 


# clones per 1 x 10 7 transfected cells 


pOMPl 


None 


none 


3500 


POMPl 


Thymidine Kinase 


Gancyclovir 


950 


pOMP3 


ZsGreen 


none 


3366 



(b) Selective isolation of non-fluorescent ES colonies. 

15,5% of all colonies in the pOMP3 gene targeting experiment were non- 
fluorescent. Isolation of non-fluorescent ES colonies under fluorescent light-source 
as described above, led to significant enrichment of such ES colonies. By this 
method 99,3% (all but 1 clone) were non-fluorescent when monitored upon 
isolation see Table (3), 



Table 3: Enrichment of isolation of 'non-fluorescent ES colonies in pOMP3 
experiments. " 
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Isolated (%) 


Random isolation 


Fluorescent 


84,5 


Random isolation 


Non-fluorescent 


15,5 


Selective isolation 


Fluorescent 


0,7 


Selective isolation 


Non-fluorescent . 


99,3 



(c) HR frequency in pOMP targeting experiments 

The analysis of non-fluorescent ES cells for HR events in the case of pOMP3 
targeting resulted in a targeting frequency of 6,49% +/- 0,5% in 3 independent 
experiments (Table 4).' 

This is. a 8fold higher frequency if compared to targeting of pOMPl without counter- 
selection. Compared to pOMPl gene targeting plus counterselection with 
Gancyclovir (3,02% HR), optical isolation was 2,2fold more efficient. 



Table 4: Enrichment of gene targeting frequency in non-fluorescent ES clones, (see 
also Fig. 5) 



Construct 


Selection 


HR - clones/analyzed 


HR frequency (%) 


pOMPl . 


None 


6/708 


0,85 


pOMPl 


TK/GanC 


10/331 


3,02 


PO.MP3 


fluorescent 


0/452 


0,00 


POMP3 


Non-fluorescent 


16/239 


6,6'9 " 



(d) HR frequency in Automated pOMP targeting experiments 

Transfected, sorted and individually seeded cells attached and divided at a 
frequency of 34%. Transfected, sorted and individually seeded cells attached and 
divided at a frequency of 34%. 9 individual clones were analyzed and HR clones 
identified at a frequency of 4%. 

. Table 5: Frequency of gene targeting in FACS sorted and single-cell plated ES 
clones. 
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. Construct 


. Single cell plating 
efficiency 


HR 

clones/analyzed 


HR 
frequency 


POMP3 . 


34% 


1/25 


4% 



(e) The automated, gene targeting procedure does not affect competence of mouse 
.■ production of wild-type w Art4.12" ES cells and HR ES clone "OMP 3 Cyto 1-B6". 
Cytocon™ 300 sorted ES cells produce highly coat colour chimeric mice (see Table 
6), and, via tetraploid complementation, solely ES clone derived mice (see Table 7). 



Table 6: Production of Chimeric mice by diploid Injection. 





Art 4.12 
Parental 


Targeted ES clone 
OMP 3 Cyto 1-B6 


# Injected diploid 


61 


52 




# Pups born 


14 


13 




# Chimeric pups 


11 


.13 




Table 7: Production of ES mice by tetraploid Iniection. 




Art 4.12 
Parental . 


. Targeted ES clone 
OMP 3 Cyto 1-B6 




# Injected tetraploid 


153 


60. 




# ES mice born 


1 


3 
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■ Claims : • ■ 

1. : A method for isolation, of primary mammalian after homologous recombination 
comprising ' 

(a) transfecting starting primary cells with a homologous targeting .vector 
comprising 

(A) a homologous targeting cassette which comprises (i) a functional DNA 
segment and a positive selection marker or a functional DNA segment and. a first 
detectable marker; and (ii) two DNA segments homologous to the integration 
site within the genome of the primary mammalian cells flanking (i); and ... 

(B) . an expression cassette harboring a DNA sequence coding for a second 
detectable marker different from said first detectable marker, said expression 
cassette (B) being connected with said homologous targeting cassette (A) so as 
to allow distinction between targeted and non-targeted cells; 

(b) manual or automatic identification and/or isolation of cells containing the 
positive selection marker or the first detectable marker; and 

(c) manual or automatic identification and/or isolation of cells as homologous 
recombinants by the absence of the second detectable marker. 

2, The method of claim 1, wherein 

(i) the expression cassette (B) comprises a gene coding for said second detectable 
marker under the control of a promoter active in primary mammalian cells, 
preferably the promoter is a constitutive promoter; .and/or .. . 

(ii) said first and/or second detectable marker is a non-toxic, directly or indirectly 
detectable compound, preferably is a compound being detectable by colorimetric, 
fluorescence, chemiluminescence, phosphorescence detection methods, is a 
membrane bound or protein with membrane anchoring signal sequence or the like, 
most preferably is a fluorescent peptide; and/or 

(iii) the expression cassette (B) is positioned 5' or 3' relative to the homologous 
targeting cassette; and/or 

(iv) the primary, mammalian cells are pluripotent or totipotent cells, preferably are 
ES cells; and/or ... 

(v) the primary mammalian cells are derived from a rodent such as mouse, rat, etc. 
preferably the primary mammalian cells are mouse ES cells. ; 
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3. The method of claim 1 or 2, wherein 

(!) the functional DNA segment of the targeting cassette (A) is a DIM A sequence 
encoding a gene of interest/ or is a functional DNA sequence which can be 
converted into such DNA sequence encoding a gene of interest, or intronic 
sequence, or is a regulatory functional DNA sequence, including, but not limited to, 
splice receptor of splice donor or acceptor sequences, and recombinase recognition 
site(s); and/or 

(ii) the functional DNA segment may further comprise gene expression control 
elements including, but not limited to, ubiquitous or tissue specific promoter, either 
constitutive or inducible, a polyadenylation signal, intron sequences, recombinase 
recognition site(s) (RRS), enhancer recognition site(s), and . matrix attachment 
region(s) (MAR); and/or 

(iii) the positive selection marker is a DNA sequence encoding a protein conferring, 
resistance against cell . poison, including, but not limited to, neomycine, 
hygromycine, puromycine, histidinol and blieomycine, is a DNA sequence, conferring 
superior metabolic properties to the cells, including the utilization of xanthine, 
adenine etc., or the like; and/or, 

(iv) the first and/or second detectable marker, are DNA sequences encoding a 
protein allowing direct optical detection, such as by detection of fluorescence, 
chemiluminescence or the like, or indirect optical detection, such as a.cplorimetric 
assay or the like, provided that said second detectable marker differs from said first 
detectable marker in order to allow separate detection; and/or •„ 

(v) the second detectable marker is a membrane bound or protein with membrane 
anchoring signal sequence; and/or 

(vi) the positive selection marker and the first detectable marker are flanked by one 
or more RRSs; and/or 

(vii) the flanking homologous DNA segments have a length of 0.1 to 20 kb, 
preferably 0.5 to 10 kb; and/or 

(viii) the flanking DNA sequences are homologous to the Rosa26, HPRT, beta-actin, 
GAPDH locus of the eukaryotic cells. 



4. The method of claim 2 or 3, wherein 
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(i) 'the first and/or second detectable marker peptide is a fluorescence protein 
including, but . not limited to, green fluorescence protein (GFP) and modifications 
thereof, the constitutive promoter of the expression cassette is selected from 
GAGGS, CMV, PGK, TK, preferably the fluorescence protein' is ZsGreen and the 
constitutive promoter is the CAGGS promoter; and/or . 

(ii) the first and/or second detectable marker is a protein including/but not limited 
to, CDla, CD2, CD3, CD4, CD5, CD6, 008, 0011, 0014, CD15, CD16, CD19, CD20, 
CD22, CD25, CD27, CD30, CD33, CD34, CD43, CD45, CD56, CD61, CD62, CD69, . 
CD71, CD90, CD105, CD117, CD123, CD133, CD138, BDCA-2, BDCA-3, BDCA-4, 
CRTH2, ErbB-2, and Ep-CAM (Human Epithelial Antigen HEA) and preferably the. 
positive selection marker is.neomycine and the second selectable marker is selected 
from ZsGreen CD4 and CD8, or. the first and second selectable markers are 
ZsGreen and HcRed. 

5. The vector of claim 4, wherein the homologous sequences are homologous to the 
mouse Rosa26 locus, and preferably said vector has the sequence of SEQ ID. NO. 

6. " V; " ;.. ■ ■ " ,. • • ' •: . 

6.. The method according to any one of claims 1 to 5, wherein the identification or 
isolation is performed automatically, preferably ' • . ; . 

, (i) the isolation comprising effected by fluorescence activated cell sorting (FACS); 
and/or " . • /. 

(ii) the .isolation buffer is an. isotonic. buffered solution, preferably :.a.. reduction, of- 
electrolytes buffering is achieved with but not limited to. carbohydrates such as 
glucose or insitol and/or 

(iii) the cells to be sorted remain in the buffer for a time shorter than 30 minutes, 
preferably shorter than 20 minutes. 

7. The method according to any one of claims 1 to 6 which further 

(i) plating of transfected cells into standard tissue culture vessels; and/or 

(ii) limited dilution plating of transfected cells into multiwell plates, including 
96well, 384 well and 1536 well plates, in order to aljow preferential growth and 
analysis of single clones in individual wells/plate, and- optionally. . . 

(iii) subsequent detection and isolation of transfected cells plated as described 
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above by, manually or automatically selecting for the absence of the first detectable 
marker and/or the presence of the second detectable marker. 

8. The method of claim 7 wherein the presence of the first and/or absence of the 
second marker is analyzed by (FACS) mechanism, whereby the isolated sorted cells 
are re-plated as. described in claim 7. 

9. A method for preparing transgenic tissues, organs and/or multi-cell organisms 
which comprises utilizing the method as defined in claims 1 to 8, • 

10. The method of claim 9, wherein the transgenic multi-cell organism is a non- 
human mammal, most preferably mouse, and said method comprises modifying an 
ES cell as defined in claims 1 to 9. 

11. The method of claim 10 which further comprises one or more of the steps. 

(i) injecting isolated ES cells in or aggregating isolated ES cells with diploid and/or 
multiploid preimplantation embryos, preferably blastocysts, or 

(ii) injecting in/fusion of nuclei from ES cells with enucleated non-fertilized eggs 
(nuclear transfer); and/or ■ 

(Hi) generating transgenic non-human embryos and/or animals. 

12. A vector for targeted homologous recombination of eukaryotic cells as defined 
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' SEQUENCE LISTING 
<110> Artemis Pharmaceuticals GmbH 

<120> Automated Gene-Targeting Using Non-Toxic Detectable 
Markers : 

<130> 040104wo JH/BM : . 

<140> 
<141> 

<150> 03002424 . 4 
<151> 2003-03-05 • 

<150> US 60/446,921 
<151> 2003-03-12 

<160> 13 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 23 
<212> DNA ' 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
Rsceenls 

<400> 1 

gacaggacag tgcttgttta agg . . 

<210> 2 
<211> 23 
<212> 'DNA 

<213> Artificial Sequence - 
<220> 

<223> Description of Artificial Sequence: Primer 
Rscreenlas 

<400> 2 

tgactacaca atattgctcg cac 



<210> 3 
<211> 13139 
<212> DMA 

<213> Mus musculus 



<220> 

<223> Description: Rosa 26 locus 
<400> 3 

aagcttctca cgtagcaacc agagctccag 

actcctgtga tccaacacag gagcaacctt 

cttttttttg gggggggggg gggaacaagt 

tttagaaagg aacaatccta aggtcacttt 

aaataccaaa ttccaaatat .tcgttttaat 

ttttagtttt aaaaaggtca attttccaaa 

, aggaacaaga g.ccttgggtc atgtcctacc 

caatctgaat aaccttaaag cctctaatcc 

ctaatgatcc tcctgagaaa actggaagaa 

aaactcagaa atctctaatt atgtcagtat 



sequence 



agccagcagc tgctgccgcc ttgtatactc 60 
ttctttaccc. cacccccact tcttaacaca 120 
gctccatgct ggaaggattg gaactatgct 180 
taaattgagg tctttgattt gaaaatcaac 240 
taaaccagca atgtggatat. aagcattaag 300 
.cattcagcaa tcatatttaa atttacagct 360 
aaagaacata actcaatatt ctacacatga 420 
cataacaggc cacaaatttt ggacagagaa '480 
atccagggaa aagaaattcc tgtgtcctcc 540 
tctctgcttt agtcctaggt cagattgcac 600 
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acatctaaaa taacctctta aagttttcct cctagcgacc taaaccatta ttaatatcaa 660 
attaaccatc aaaacacttt cctctcaata tgct'gcacac aaacctcctc ctggaac'ctc 720 
ctccatctgg atcctcccca atcaaaagta taggtattta acatataagc aaggaagtaa 780 
tgtaaacatg accttggtca caaatatgtc atctaaaaac aatttagtca aggtatggag 840 
gaaattcgag' aacctgaatc tttttaagta ttttgagcac aggaacaatt ggcaaaagga 900 
atccaggtat agacaaaacc cagagcccag agctctgggc gaaaaatgag ttgctggtga 960 
agacgttaca. " caagtaacat gagaaagcag aaaatgcagg tcatccacgc acccctgacc 1020 
caggccagca gggcgggctg ' cagcatcagt acacaggaga aagatcctta ttcctaagaa 1080 
. tgagaaaggc aaaggcgccc gatagaataa attagcatag aaggggcttt cccaggagtt 1140 
aaaactttcc ttctgagcga ttacctacta aaaccagggc ttttgcccac taccatttac 1200 
ctaggatctt. ggcttgcacg .gattcatagg ggcatatccc tccccctctt cttt'agagtc 1260 
gttcttaaaa gatcgctctc cacgccctag gcagggaaaa cgacaaaatc tggctcaatt 1320 
ccaggctaga- accctacaaa ttcaacaggg atatcgcaag gatactgggg catacgccac 1380 
agggagtcca agaatgtgag gtgggggtgg cgaaggtaat gtctttggtg tgggaaaagc 1440 
.agcagccatc tgagatagga ' actggaaaac cagaggagag gcgttcagga agattatgga 1500 
ggggaggact gggcccccac gagcgaccag agttgtcaca aggccgcaag aacaggggag 1560 
gtggggggct cagggacaga aaaaaaagta tgtgtatttt gagagcaggg ttgggaggcc 1620 
tctcctgaaa agggtataaa cgtggagtag gcaataccca ggcaaaaagg ggagaccaga 1680 
gtagggggag .gggaagagtc ctgacccagg gaagacatta aaaaggtagt ggggtcgact 1740 
agatgaagga gagcctttct ctctgggcaa gagcggtgca atggtgtgta aaggtagctg 1800 
agaagacgaa aagggcaagc atcttcctgc taccaggctg gggaggccca ggcccacgac 1860 
cccgaggaga gggaacgcag ggagactgag gtgacccttc tttcccccgg ggcccggtcg 1920 
tgtggttcgg tgtctctttt ctgttggacc cttaccttga cccaggcgct gccggggcct 1980 
gggcccgggc tgcggcgcac ggcactcccg ggaggcagcg agactcgagt taggcccaac 2040 
gcggcgccac ggcgtttcct ggccgggaat ggcccgtacc cgtgaggtgg gggtgggggg -2100 
cagaaaaggc ggagcgagcc cgagcgggga gggggagggc caggggcgga gggggccggc 2160 
actactgtgt tggcggactg gcgggacta^ ggotgcgtga gtctctgagc gcaggcgggc 2220 
ggcggccgcc cctcccccgg cggcggcagc ggcggcagcg gcggcagctc actcagcccg 2280 
ctgcccgagc- ggaaacgcca ctgaccgcac ggggattccc agtgccggcg ccaggggcac 2340 
gcgggacacg. ccccctcccg ccgcgccatt ggcctctccg cccaccgccc cacacttatt 24 00- 
ggccggtgcg ccgcpaatca gcggaggctg ccggggccgc ctaaagaaga ggctgtgctt 24 60 
tggggctccg gctcctcaga gagcctcggc taggtagggg atcgggactc tggcgggagg 2520 
gcggcttggt gcgtttgcgg ggatgggcgg ccgcggcagg ccctccgagc gtggtggagc 2580 
cgttctgtga gacagccggg tacgagtcgt gacgctggaa ggggcaagcg ggtggtgggc 264 0 
aggaatgcgg tccgccctgc agcaaccgga gggggaggga gaagggagcg gaaaagtctc 2700' 
caccggacgc ggccatggct cggggggggg ■ ggggcagcgg aggagcgctt ccggccgacg 27 60 
tctcgtcgct gattggcttc ttttcctccc gccgtgtgtg aaaacacaaa tggcgtgttt 2820 
tggttggcgt aaggcgcctg tcagttaacg gcagccggag tgcgcagccg ccggcagcct 2880 
cgctctgccc actgggtggg gcgggaggta ggtggggtga ggcgagctgg acgtgcgggc 2940 
gcggtcggcc tctggcgggg cgggggaggg gagggagggt cagcgaaagt agctcgcgcg 3000 
•cgagcggccg cccaccctcc ccttcctctg ggggagtcgt tttacccgcc gccggccggg 3060 
cctcgtcgtc tgattggctc tcggggccca gaaaactggc ccttgccatt ggctcgtgtt 3120 
cgtgcaagtt gagtccatcc gccggcca.gc gggggcggcg aggaggcgct cccaggttcc 3180 
ggccctcccc- tcggccccgc gccgcagagt ctggccgcgc gcccctgcgc aacgtggcag 3240 
gaagcgcgcg ctgggggcgg ggacgggcag tagggctgag cggctgcggg gcgggtgcaa 3300 
gcacgtttcc gacttgagtt gcctcaagag gggcgtgctg agccagacct ccatcgcgca 3360 
ctccggggag tggagggaag gagcgagggc tcagttgggc tgttttggag gcaggaagca 3420 
cttgctctcc caaagtcgct ctgagttgtt atcagtaagg gagctgcagt ggagtaggcg 3480 
gggagaaggc cgcacccttc tccggagggg ggaggggagt gttgcaatac ctttctggga 3540 
gttctctgct gcctcctggc ttctgaggac cgccctgggc ctgggagaat cccttccccc 3600 
tcttccctcg tgatctgcaa ctccagtctt tctagaagat gggcgggagt cttctgggca 3660 
ggcttaaagg ctaacctggt gtgtgggcgt tgtcctgcag gggaattgaa caggtgtaaa 3720 
attggaggga caagacttcc cacagatttt cggttttgtc gggaagtttt ttaatagggg 3780 
caaataagga aaatgggagg ataggtagtc atctggggtt ttatgcagca aaactacagg 3840 
ttattattgc ttgtgatccg cctcggagta ttttccatcg aggtagatta aagacatgct 3900 
cacccgagtt ttatactctc ctgcttgaga tccttactac agtatgaaat tacagtgtcg 3960 
cgagttagac tatgtaagca gaattttaat catttttaaa gagcccagta cttcatatcc -4020 
atttctcccg ctccttctgc agccttatca aaaggtattt tagaacactc attttagccc 4080 
cattttca.tt tattatactg gcttatccaa cccctagaca gagcattggc attttccctt 4140 
tcctgatctt agaagtctga tgactcatga aaccagacag attagttaca tacaccacaa 4200 
atcgaggctg tagctggggc ctcaacactg cagttctttt ataactcctt agtacacttt 4260 
■ttgttgatcc tttgccttga tccttaattt tcagtgtcta tcacctc.tcc 'cgtcagtggt 4 320. 
gttccacatt .tgggcctatt ctcagtccag ggagttttac aacaatagat gtattgagaa 4380 
tccaacctaa agcttaactt tccactccca tgaatgcctc tctccttttt ctccatttat 4 440 
aaactgagct attaaccatt aatggttcca ggtggatgtc tcctccccat attacctgat 4 500 
gtatcttaca tattgccagg ctgatatttt aagacattaa. aaggtatatt tcattattga 4560 
gccacatggt attgattact gcttactaaa attttgtcat tgtacacatc tgtaaaaggt 4 620' 
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ggttcctttt ggaatgcaaa gttcaggtgt ttgttgtctt tcctgaccta aggtcttgtg 4680 
agcttgtatt ttttctattt aagcagtgct ttctcttgga ctggcttgac tcatggcatt 4740 
ctacacgtta ttgctggtct aaatgtgatt ttgccaagct tcttcaggac ctataatttt 4800 
gcttgacttg' tagccaaaca caagtaaaat gattaagcaa caaatgtatt tgtgaagctt 4860 
ggtttttagg ttgttgtgtt gtgtgtgctt gtgctctata ataatactat ccaggggctg 4920 
gagaggtggc tcggagttca agagcacaga ctgctcttcc agaagtcctg agttcaattc 4 980 
ccagcaacca catggtggct cacaaccatc tgtaatggga tctgatgccc tcttctggtg 5040 
tgtctgaaga ccacaagtgt ■ attcacatta aataaataaa tcctccttct tcttcttttt 5100 
ttttttttta aagagaatac tgtctccagt agaatttact gaagtaatga aatactt.tgt 5160. 
gtttgttcca atatggtagc caataatcaa attactcttt aagcactgga aatgttacca 5220 
aggaactaat ttttatttga agtgtaactg tggacagagg agccataact gcagacttgt 5280 
gggatacaga agaccaatgc agactttaat gtcttttctc ttacactaag caataaagaa 5340 
ataaaaattg aacttctagt • atcctatttg tttaaactgc tagctttact taacttttgt 5400 
gcttcatcta tacaaagctg aaagctaagt ctgcagccat tactaaacat gaaagcaagt 54 60 
aatgataatt ttggatttca aaaatgtagg gccagagttt agccagccag tggtggtgct 5520 
tgcctttatg cctttaatcc cagcactctg gaggcagaga . caggcagatc tctgagtttg 5580. 
agcccagcct ggtctacaca tcaagttcta tctaggatag ccaggaatac acacagaaac 5640 
cctgttgggfg aggggggctc tgagatt'tca taaaattata attgaagcat tccctaatga' 5700 
gccactatgg atgtggctaa atccgtctac ctttctgatg agatttgggt attatttttt 5760 
ctgtctctgc tgttggttgg gtcttttgac actgtgggct ttctttaaag cctccttcct 5820 
gccatgtggt ctcttgtttg ctactaactt cccatggctt aaatggcatg gctttttgcc 5880 
ttctaagggc agctgctgag atttgcagcc tgatttccag ggtggggttg ggaaatcttt 5940 
caaacactaa aattgtcctt taattttttt tttaaaaaat .gggttatata ataaacctca 6000 
taaaatagtt a'tgaggagtg aggtggacta atattaaatg agtccctccc ctataaaaga 6060 
gctattaagg ctttttgtct tatacttaac ttttttttta aatgtggtat ctttagaacc 6120 

' aagggtctta gagttttagt atacagaaac tgttgcatcg cttaatcaga ttttctagtt 6180 
tcaaatccag agaatccaaa ttcttcacag ccaaagtcaa attaagaatt tctgactttt 6240 
aatgttaatt tgcttactgt gaatataaaa atgatagctt ttcctgaggc agggtctcac 6300 
tatgtatctc tgcctgatct gcaacaagat atgtagacta aagttctgcc tgcttttgtc 6360 
tcctgaatac taaggttaaa atgtagtaat .acttttggaa cttgcaggtc agattctttt 6420 
ataggggaca cactaaggga gcttgggtga tagttggtaa aatgtgtttc aagtgatgaa 64 80 
aacttgaatt attatcaccg caacctactt. tttaaaaaaa aaagccaggc ctgttagagc 6540 
atgcttaagg gatccctagg acttgctgag cacacaagag tagttacttg gcaggctcct 6600 
ggtgagagca tatttcaaaa aacaaggcag acaaccaaga aactacagtt aaggttacct 6660 
gtctttaaac catctgcata tacacaggga tattaaaata ttccaaataa tatttcattc 6720 
aagttttccc ccatcaaatt gggacat'gga tttctccggt gaataggcag agttggaaac 6780 
taaacaaatg ttggttttgt gatttgtgaa attgttttca agtgatagtt aaagcccatg 68 4 0 
agatacagaa caaagctgct atttcgaggt ctcttggttt atactcagaa gcacttcttt 6900 
gggtttccct gcactatcct gatcatgtgc taggcctacc. ttaggctgat tgttgttcaa 6960 
ataaacttaa gtttcctgtc aggtgatgtc atatgatttc atatatcaag gcaaaacatg 7.020 
ttatatatgt taaacatttg tacttaatgt gaaagttagg ■ tctttgtggg tttgattttt 7080 
aattttcaaa acctgagcta aataagtcat ttttacatgt cttacatttg gtggaattgt 7140- 
ataattgtgg tttgcaggca agactctctg acctagtaac cctacctata gagcactttg 7200 
ctgggtcaca agtctaggag tcangcattt. caccttgaag ttgagacgtt ttgttagtgt • 7260- 
atactagttt atatgttgga ggacatgttt atccagaaga tattcaggac tatttttgac 7320 
tgggctaagg aattgattct gattagcact gttagtgagc attgagtggc ctttaggctt 7380 
gaattggagt cacttgtata tctcaaataa tgctggcctt ttttaaaaag cccttgttct 7440 
ttatcaccct gttttctaca taatttttgt tcaaagaaat acttgtttgg atctcctttt 7500. 
gacaacaata gcatgttttc aagccatatt ttttttcctt tttttttttt tttttggttt 7560 
ttcgagacag ggtttctctg tatagccctg gctgtcctgg aactcacttt gtagaccagg 7620 
ctggcctcga actcagaaat ccgcctgcct ' ctgcctcctg agtgccggga ttaaaggcgt 7680 
gcaccaccac gcctggctaa gttggatatt ttgttatata act.ataacca atactaactc 7740 
cactgggtgg atttttaatt cagtcagtag tcttaaqtgg tctttattgg cccttcatta 7800 
aaatctactg ttcactctaa cagaggctgt tggtactagt ggcacttaag caacttccta 7860 
cggatatact agcagattaa gggtcaggga tagaaactag tctagcgttt tgtataccta 7920 
ccagctttat actaccttgt tctgatagaa atatttcagg acatctagag tgtactataa 7980 
ggttgatggt aagcttataa ggaacttgaa agtggagtaa ctactccatt tctctgaggg 8040 
gagaattaaa atttttgacc aagtgttgtt gagccactga gaatggtctc agaacataac 8100 

.ttcttaagga ,accttcccag attgccctca acactgcacc acatttggtc ctgcttgaac 8160. 
attgccatgg ctcttaaagt cttaattaag aatattaatt gtgtaattat tgtttttcct 8220 
cctttagatc attccttgag gacaggacag tgcttgttta aggctatatt tctgctgtct 8280 
gagcagcaac aggtcttcga gatcaacatg atgttcataa tcccaagatg ttgccattta 8340 
tgttctcaga agcaagcaga ggcatgatgg tcagtgacag taatgtcact gtgttaaatg 8400 
ttgctatgca' gtttggattt ttctaatgta gtgtaggtag aacatatgtg ttctgtatga 84 60 
attaaactct taagttacac cttgtataat ccatgcaatg tgttatgcaa ttaccatttt 8520 
aagtattgta' gctttctttg tatgtgagga taaaggtgtt tgtcataaaa tgttttgaac 8580 
atttccccaa agttccaaat tataaaacca caacgttaga acttatttat gaacaatggt 8640. 
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tgtagtttca 
atataaaact 
agagagttta 
attgtgtagt 
tgaacatttt 
actagaaatt 
gaaggaaagt 
ggcagatgaa 
aaattttccc 
gcttacaaat 
taaacctatg 
' gaacagttat 
tgttcatttg 
gaaagactga 
gagggaggga 
•gtaagttgga 
gtcatacaaa 
•ttgatttcac 
tgggcctttt 
catcctgatc 
tttatgctag 
aaatggcttc 
aaactggtac 
gtgggtcaca 
catcttcaca 
agtgaaaggt 
cacccacacc 
cctgcactgg 
attgcttatc 
aaaccttgtc 
aaatttacca 
attttttaaa 
ttaacttgag 
tggcaaagat 
tggttctgtg 
taagtttata 
ataaaacctt 
cactgactat 
ttctaataga 
.ataactttcc 
aagtaatgag 
caactcgaaa 
tatccaacac 
tgggtgattt 
gtgtacagtt 
aggcactttg 
tttctttaat 
.ttataatagt 
tgataaaact 
ccataaaaca 
agccaagcag 
attctggagt 
ccaaagcaag 
gtcttcactc 
caaaggcttg 
cccagactac 
aaaaaacata 
taaatgctat 
aggatctggg 
gtccatccaa 
aggtgagtga 
gcagtacagc 
gatagagatt 
tattgcaaaa 
gcagtcatct 
• agtgttatag 
tgcaacttag 



tgcttttaaa 
gacatgtaga 
atatagcatg 
cattttgaaa 
aaatgcagac 
aaaaagctga 
gtaatagctt 
aagaaatact 
cacagatata 
gtggcttgac 
tcctaaatag 
gttgccattt 
tggtgtgaac 
caagtttggg 
gggaggctgg 
ggaacttgta 
atgaacctag 
ctgactaata 
aacccagaaa 
tagtcctgaa 
tgtttcaaaa 
tctagaaaga 
tgagcacaca 
gaaggcaggc 
actgatgagc 
ggcaatggat 
agatagctca 
tacacataat 
tttagttccc 
tagaggagaa 
gccagagatg 
tatacttaat 
aacttagcat 
tatgtcagga 
' atcaccttgg 
ttcaatcaat 
taaaaatttc 
ggtgccaatg 
aacatcacac 
ataaagctca 
gttacttata 
acagaacaac 
ccttttctgt- 
taactctcct 
tacattttca 
ggggtgggga 
taggcttgtt 
cacacaagca 
gtaga.tctta 
gtaggttgca 
tgtaggcttc 
ggtcaagcag 
ttctttatgc 
ttttctttac 
agcagtgcgc 
atggaccttt 
agtcattagg 
gggtggtggg 
ttcagtttct 
"catcctcttc 
tttaactctc 
tctgcctgtc 
ccaatttctc 
ggtaaatagg 
tttacaggcc 
agttagtcca 
cttattgaag 



atgcttaatt 
agtgtttgtc 
tcttttgcaa 
ggagtcattt 
ttgttcgtgt 
agtatttcag 
agaaaattta 
cagtggttct 
aactctaatc 
ttgtcactgt 
tcaaactgtg 
gccctaatgg 
caaagagggg 
tcttcacagt 
cttgttatgc 
cattgttagg 
accaacttct 
cttatccctt 
tttagtttca 
aataggacca 
'ttttaggaat 
tgtacatagt 
ggactgagag 
aggaagaact 
aagaacttta 
cccaggctgc 
tagactgcca 
ttcacacaaa 
aacacttggg' 
acttcatagc 
aagctaacag 
agaattaaag 
aaaaagtagt 
acttggtaaa 
tttgaggtaa 
aatgtattat 
ataataccag 
tggattctgt 
tgaaaatcta 
aacaaggatt 
taatgatagt 
cagctaggca 
tgagggcccc 
gaggtacctt 
tttaaagata 
gggctttgga 
ggttccatac 
tctttgctct 
agggatgctt 
caagcaatag 
ttaagaaata 
tgtaacctgt 
tgatatttcc 
caaggagggt 
tttagaagat 
cgccacacat 
ctaccagtct 
ttggaaagtt 
gatacattgt 
. tgttgagggc 
gtgtatagta 
cctggtctac, 
ttcttggatc 
tagtttctta 
agagcctcta 
gtgtatagtt 
acaaaccacg 



attcaattaa 
cagaacattt 
catactaact- 
caatgagtgt 
tttagaaagc 
aagggaaata 
aaaccatata 
tttgagcaat 
tataaetcat 
gcttgtttta 
gaatgacttt 
tgaaagaaaa 
ggcatgcact 
tggaattggg 
tgacaagtgt 
agtgtgacaa 
•ggaagatgta 
atcaattaaa 
aactcctagg 
ccatcacccc 
aggtaagatt 
gaacactgaa 
gtctttcttg 
tgggctgaaa 
tcctgatgca 
tctagaggag 
attaacacca 
cacagtaaga 
aggcagaggc 
ttatttccta 
tgtccactat 
ctatggtgaa 
tcatttgttc 
tggtgatgaa 
aagtggttcc 
tttgtgattg 
ttcattctcc 
tctcaaagga 
agctgaaaga 
acttttagga 
cccacaagac 
ggaataacag 
agacctacat 
ggtaaat-ctt 
cattagctcc 
aaataacgct 
atctactggt. 
gtttaggttg 
ctgcttctga 
aatatggcct 
ccattacaat 
actgtaagtt 
agtgttagga 
ctcttccttc 
aaactgcagc 
gtcccattcc 
gattctaaaa 
gactcagaaa 
ggcttacaac 
accaaataaa 
ccttggtaaa 
agacacggct 
ccatcctata 
catgagacaa 
ttctatgcca 
ttctatcaga 
agtagaaatc 



caccgtttgt 
cttaaatgta 
tttgtgttgg 
cagattgttt 
aaaactgtca 
agctacttgc 
gttgtcattg 
ataacagctt 
aaatgttaca 
gttatgtgaa 
ttaatctatt 
agtggggagt 
tacacttcaa 
catccctttt 
gattaaattc 
tttggactct 
tataataact 
tacagaagat 
ttagtgttct 
caaaaaaatc 
agaaagtttt 
tggctcctaa 
aaaagcatgt 
ctggtgtctt 
aaaaccatcc 
gacttgactt 
gcttctagcc 
agccttccac 
cagccagggc 
ttcacgtaac 
atttgtagtg 
ccaagtacaa 
agtaaatatt 
acaatcatag 
tttgatcaag 
caaaattgcc 
agttactaat 
aggattgtct 
agccagactt 
ggcactgtta 
tatctgagga. 
ggctcccaag 
attgtataca 
tgtcctgagt. 
ctctaccccc 
•tccatacact 
gtttctacta 
tatatttatt 
gatgatacaa 
aaagtgttct 
caccttgcta 
acttttctgc 
actacaaata 
atcttgatct 
atgaaggccc 
agataaggcc 
caacctaaaa 
atcacttgct 
tataactcca 
tgtattgtgt 
acatttcttg 
catttcccga 
aaagaaggtc 
gaacaaatct 
atgaaggaaa 
acactttttt 
tgtccaagaa 



gttataatat 

tactgtcttt 

tgcgagcaat 

tgaatgttat 

gaagctttga . 

tgtattagtt 

ctgaatatct' 

gttatattaa 

aatggatgaa 

agtttggcaa 

ggtttgtcta 

gccttggcac 

acatcctttt 

gtcagggagg 

aaactttgag 

taatgatttg 

ccatgttaca 

.gccagccatc 

cactgagcta 

tcaaataaga 

aaattttgag 

agagcctaga 

attgctttac 

aagtggctaa 

aaacaaacta 

ctcatcccat 

tccacaggca 

ctggcatggt 

tatgtgacaa 

caggttagca 

ttttaagtca 

acctggtgta 

aaatgcttac 

ttgtacatct 

gatggaattt 

tatctagggt 

tccaaaaagc 

gtgcccttta 

tcctaaataa 

aggaactgat ' 

aaaatcagta 

t'caggaggtc 

aacagggagg 

aagcagtaca 

taagactgac 

aaaagagaaa 

cttagtaata 

ttaaggcaga 

agaatttaga 

gacacttaga 

gaaatcaagc 

tatttttctc 

ttaataagtt 

gaaggatgaa 

ccgatgttca 

tggcacacac 

tcttcccact 

gtttttagag 

gttctagggg 

acaaacaggg 

tcctgagtaa 

aggcaagctg. 

aagtttaatc- 

taggtgtgaa 

ctgttagtcc 

tttaaacaac 

gcaagtgctt 



8700 

8760 

8820 

8880 

8940 

9000 

9060 

9120 

9180 

9240 

9300 

9360 

9420 

9480 

9540 

9600 • 

9660 

9720 

9780 

9840 

9900 

9960 

10020 

10080 

10140 

10200 

10260 

10320 

10380 

10440 

10500 

10560- 

10620 

10680 

10740 

10800 

10860 

10920 

10980 

1104 0 

11100 

11160 

11220 

11280 

11340 

11400 

11460 

11520 

11580 

11640 

11700 

11760 

11820 

11880 

11940 

12000 

12060 

12120 

12180 

12240 

12300 

12360 

12420 

12480 

12540 

12600 

12660 
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ctcagcctac 
..atgtttttct 
agaatccaat 
agttctcagt 
tgtttttctg 
gctcagcaaa 
catgtggtac 
ccacacacaa 



aatgtggaat 
gactgagaag 
ttctaccttt 
gggaggttgc 
tggagacaaa 
taatggtact 
aaggagagac 
ataaatacaa 



aggaccatgt 
acagtacaat 
ttccaaatgg 
agatatccca 
ggtaagatat 
ggccattaag 
ctaaatcctt 
taaaaaacat 



aatggtacag 
aaaaggtaaa 
catatctgtt 
ctgaacagca 
ttcaatatat 
tctcatgacc 
cagttggact 
tctgcagtcg 



tgagtgaaat 
ctcatggtat 
acaataatat 
.tcaatgggca 
tttcccaagc 
tgagcttgat 
tcaatcttct 
aatttctaaa 



gaattatggc 
ttatttaaaa 
ccacagaagc 
aaccccaggt 
taatgagatg 
cctcaggga'c 
accctcatgt 
agggcgaat- 



12720 
12780 
12840 
12900 
12960 
13020 
13080 
13139 



<210> 4 
<211> 1073 
<212> DNA 

<213> Mus mus cuius 
<220> 

<223> Description: 5' arm of Rosa 26 



<400> 4 

caggccctcc 

ggaaggggca . 

gggagaaggg 

gcggaggagc. 

tgtgaaaaca 

ggagtgcgca 

gtgaggcgag 

gggtcagcga 

tcgttttacc 

tggcccttgc 

ggcgaggagg 

gcgcgcccct 

tgagcggctg 

gctgagccag 

gggctgttt.t 

aagggagctg 

gagtgttgca 

gggcctggga 



gagcgtggtg 
agcgggtggt 
agcggaaaag 
gcttccggcc 
caaatggcgt 
gccgccggca 
ctggacgtgc 
aagtagctcg 
cgccgccggc 
cattggctcg 
cgctcccagg 
gcgcaacgtg 
cggggcgggt 
acctccatcg 
ggaggcagga 
cagtggagta 
atacctttct 
gaatcccttc 



gagccgttct 
gggcaggaat 
tctccaccgg 
gacgtctcgt 
gttttggttg 
gcctcgctct 
gggcgcggtc 
cgcgcgagcg 
cgggcctcgt 
tgttcgtgca 
ttccggccct 
gcaggaagcg 
gcaagcacgt 
cgcactccgg 
agcacttgct 
ggcggggaga 
.gggagttctc 
cccctcttcc 



gtgagacagc 
gcggtccgcc 
acgcggccat 
cgctgattgg. 
gcgtaaggcg 
gcccactggg 
ggcctctggc 
gccgcccacc 
cgtctgattg 
agttgagtcc 
cccctcggcc 
cgcgctgggg 
ttccgacttg 
ggagtggagg 
ctcccaaagt 
aggccgcacc 
tgctgcctcc 
ctcgtgatct 



cgggtacgag 
ctgcagcaac 
gqctcggggg 
cttcttttcc 
cctgtcagtt 
tggggcggga 
ggggcggggg 
ctccccttcc 
gctctcgggg 
atccgccggc 
ccgcgccgca 
gcggggacgg 
agttgcctca 
gaaggagcga 
cgctctgagt 
cttctccgga 
tggcttctga 
gcaactccag 



tcgtgacgct 
cggaggggga 

ggggggggca 

tcccgccgtg 
aacggcagcc 
ggtaggtggg 
aggggaggga 
tctgggggag 
cccagaaaac 
cagcgggggc 
gagtctggcc 
gcagtagggc 
agaggggcgt 
gggctcagtt 
tgttatcagt 

ggggggaggg 

ggaccgccct 
tct 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1073- 



<210> 5 

<211> 4333 

<212> DNA ' 

<213> Mus musculus 

<220> 

<223> Description: 3' 



arm of Rosa 26 



<400> 5 

tagaagatgg 

tcctgcaggg 

gttttgtcgg 

ctggggtttt 

ttccatcgag 

cttactacag 

tttttaaaga 

aggtatttta 

cctagacaga 

ccagacagat 

gttcttttat 

agtgtctatc 

agttttacaa. 

aatgcctctc 

tggatgtctc 

gacattaaaa 

tttgtcattg 

gttgtctttc 

ctcttggact 



gcgggagtct 
gaattgaaca 
gaagtttttt 
atgcagcaaa 
gtagattaaa 
tatgaaatta 
gcccagtact 
gaacactcat 
gcattggcat 
tagttacata 
aactccttag 
acctctcccg 
caatagatgt 
tcctttttct 
ctccccatat 
ggtatatttc 
tacacatctg 
ctgacctaag 
ggcttgactc 



tctgggcagg 
ggtgtaaaat 
aataggggca 
actacaggtt 
gacatgctca 
cagtgtcgcg 
tcatatccat 
tttagcccca 
tttccctttc 
caccacaaat 
tacacttttt 
tcagtggtgt 
attgagaatc 
ccatttataa 
tacctgatgt 
attattgagc 
taaaaggtgg 
gtcttgtgag 
atggcattct 



cttaaaggct 
tggagggaca 
aataaggaaa 
attattgctt 
cccgagtttt 
agttagacta 
ttctcccgct 
ttttcattta 
ctgatcttag 
cgaggctgta 
gttgatcctt 
tccacatttg 
caacctaaag 
actgagctat 
atcttacata 
cacatggtat 
ttccttttgg 
cttgtatttt 
acacgttatt 



aacctggtgt 
agacttccca 
atgggaggat 
gtgatccgcc 
atactctcct 
tgtaagcaga 
ccttctgcag 
ttatactggc 
aagtctgatg 
gctggggcct 
tgccttgatc 
ggcctattct 
cttaactttc 
taaccattaa 
ttgccaggct 
tgattactgc 
aatgcaaagt 
ttctatttaa 
gctggtctaa 



gtgggcgttg 
cagattttcg 
aggtagtcat 
tcggagtatt 
gcttgagatc 
attttaatca 
ccttatcaaa 
ttatccaacc 
actcatgaaa 
caacactgca 
cttaattttc 
cagtccaggg 
cactcccatg 
tggttccagg 
gatattttaa 
ttactaaaat 
tcaggtgttt 
gcagtgcttt 
at.gtgatttt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 
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gccaagcttc ttcaggacct ataattttgc ttgacttgta gccaaacaca agtaaaatga 1200 

ttaagcaaca aatgtatttg tgaagcttgg tttttaggtt gttgtgttgt gtgtgcttgt 1260 

gctctataat aatactatcc aggggctgga gaggtggctc ggagttcaag agcacagact 1320 

gctcttccag aagtcctgag ttcaattccc agcaaccaca tggtggctca caaccatctg 1380 

taatgggatc tgatgccctc ttctggtgtg tctgaagacc acaagtgtat tcacattaaa 1440 

taaataaatc .ctccttcttc "ttcttttttt tttttttaaa gagaatactg tctccagtag 1500 

aatttactga agtaatgaaa tactttgtgt ttgttccaat a.tggt.agcca ataatcaaat 1560 

tactctttaa gcactggaaa tgttaccaag gaactaattt ttatttgaag tgtaactgtg 1620 

gacagaggag ccataactgc. agacttgtgg gatacagaag: accaatgcag actttaatgt 1680 

cttttctctt acactaagca ataaagaaat aaaaattgaa cttctagtat cctatttgtt 1740 

taaactgcta gctttactta acttttgtgc ' ttcatctata caaagctgaa agctaagtct 1800 

gcagccatta ctaaacatga aagcaagtaa tgataatttt ggatttcaaa aatgtagggc 1860 

cagagtttag ccagccagtg gtggtgcttg cctttatgcc tttaatccca gcactctgga 1920 

ggcagagaca ggcagatctc tgagtttgag cccagcctgg tctacacatc aagttctatc 1980 

taggatagcc aggaatacac acagaaaccc tgttggggag gggggctctg agatttcata 2040 

aaattataat tgaagcattc cctaatgagc cactatggat gtggctaaat ccgtctacct 2100 

ttctgatgag atttgggtat tattttttct gtctctgctg ttggttgggt cttttgacac 2160 

tgtgggcttt ctttaaagcc tccttcctgc catgtggtct cttgtttgct actaacttcc 2220 

catggcttaa atggcatggc tttttgcctt ctaagggcag ctgctgagat ttgcagcctg 2280 

atttccaggg tggggttggg aaatctttca aacactaaaa ttgtccttta attttttttt 2340 

taaaaaatgg gttatataat aaacctcata aaatagttat gaggagtgag gtggactaat 2400 

attaaatgag tccctcccct ataaaagagc tattaaggct: ttttgtctta tacttaactt 24 60 

tttttttaaa tgtggtatct ttagaaccaa gggtcttaga gttttagtat acagaaactg 2520 

ttgcatcgct taatcagatt ttctagtttc aaatccagag "aatccaaatt cttcacagcc 2580 

aaagtcaaat taagaatttc tgacttttaa tgttaatttg cttactgtga atataaaaat 2640 

gatagctttt cctgaggcag ggtctcacta tgtatctctg cctgatctgc aacaagatat 2700 

gtagactaaa gttctgcctg cttttgtctc ctgaatacta aggttaaaat gtagtaatac 2760 
ttttggaact tgcaggtcag attcttttat aggggacaca ctaagggagc ttgggtgata . 2820 

gttggtaaaa tgtgtttcaa gtgatgaaaa cttgaattat tatcaccgca acctactttt 2880 

taaaaaaaaa agccaggcct gttagagcat gcttaaggga tccctaggac ttgctgagca 2940' 

cacaagagta .gttacttggc aggctcctgg tgagagcata tttcaaaaaa caaggcagac 3000 

aaccaagaaa ctacagttaa. ggttacctgt ctttaaacca tctgcatata cacagggata 3060 

ttaaaatatt ccaaataata tttcattcaa gttttccccc atcaaattgg gacatggatt 3120 

tctccggtga ataggcagag ttggaaacta aacaaatgtt ggttttgtga tttgtgaaat 3180 

tgttttcaag tgatagttaa agcccatgag atacagaaca aagctgctat ttcgaggtct 3240 

cttggtttat actcagaagc acttctttgg gtttccctgc actatcctga tcatgtgcta 3300 

ggcctacctt aggctgattg ttgttcaaat aaacttaagt ttcctgtcag gtgatgtcat 3360 

atgatttcat atatcaaggc aaaacatgtt atatatgtta aacatttgta cttaatgtga 3420 

aagttaggtc tttgtgggtt tgatttttaa ttttcaaaac ctgagctaaa taagtcattt 3480 

ttacatgtct tacatttggt ggaattgtat aattgtggtt tgcaggcaag actctctgac 3.540 

ctagtaaccc tacctataga gcactttgct gggtcacaag tctaggagtc aagcatttca 3600 

ccttgaagtt gagacgtttt gttagtgtat actagtttat atgttggagg acatgtttat 3660 

ccagaagata ttcaggacta tttttgactg ggctaaggaa ttgattctga ttagcactgt 3720 

tagtgagcat tgagtggcct ttaggcttga attggagtca cttgtatatc tcaaataatg 37.80 

ctggcctttt ttaaaaagcc cttgttcttt atcaccctgt tttctacata atttttgttc 3840 

aaagaaatac ttgtttggat ctccttttga caacaatagc atgttttcaa gccatatttt 3900 

ttttcctttt tttttttttt tttggttttt cgagacaggg tttctctgta tagccctggc 3960 

tgtcctggaa ctcactttgt agaccaggct ggcctcgaac tcagaaatcc gcctgcctct 4020 

gcctcctgag tgccgggatt aaaggcgtgc accaccacgc ctggctaagt tggatatttt 4080 

gttatataac tataaccaat actaactcca ctgggtggat ttttaattca gtcagtagtc 4140 
ttaagtggtc tttattggcc cttcattaaa atctactgtt cactctaaca gaggctgttg 4200 

gtactagtgg cacttaagca acttcctacg gatatactag cagattaagg gtcagggata 4260 

gaaactagtc tagcgttttg tatacctacc agctttatac taccttgttc tgatagaaat 4320 

atttcaggac ate 4333 



<210> 6 
<211> 13544 
<212> DNA 

<213> Artificial Sequence • 

<220> .... 
<223> Description of Artificial Sequence: Rosa 26 
targeting vector pOMP3 with zsgreen • 

<400> 6 

.ctagggataa cagggtaata tagccgcggc aggccctccg agcgtggtgg agccgttctg 60 
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•tgagacagcc" gggtacgagt cgtgacgctg 
.cggtccgccc tgcagcaacc ggagggggag 
cgcggccatg gctcgggggg gggggggcag 
gctgattggc ttcttttcct cccgccgtgt 
cgtaaggcgc ctgtcagtta. acggcagccg 
•cccactgggt ggggcgggag gtaggtgggg 
gcctctggcg gggcggggga ggggagggag 
ccgcccac.cc tccccttcct ctgggggagt 
gtctgattgg ctctcggggc ccagaaaact 
gttgagtcca tccgccggcc agcgggggcg 
ccctcggccc cgcgccgcag agtctggccg 
gcgctggggg cggggacggg cagtagggct 
tccgacttga gttgcctcaa gaggggcgtg 
gagtgga'ggg aaggagcgag ggctcagttg. 
tcccaaagtc gctctgagtt gttatcagta 
ggccgcaccc ttctccggag gggggagggg 
gctgcctcct ggcttctgag gaccgccctg' 
tcgtgatctg caactccagt ctttctaggt 
gtctagggcg cagtagtcca gggtttcctt 
tttccacagc tcgcggttga ggacaaactc 
actgactgag tcgactaggg gtaaccataa 
ttataatcfca gaactagtgg atccacgatt 
taggggaggc gcttttccca aggcagtctg 
cttggcgcta cacaagtggc ctctggcctc 
aaccggctcc gttctttggt ggccccttcg 
agttcccccc cgccccgcag ctcgcgtcgt 
tcactagtct cgtgcagatg gacagcaccg 
gcagcggcca atagcagctt tgctccttcg 
ggtccggggg cgggctcagg gcgggctcag 
aggcccggca ttctgcacgc ttcaaaagcg 
atctccgggc ctttcgacct gcagccaata 
acgcaggttc tccggccgct tgggtggaga 
caatcggctg ctctgatgcc gccgtgttcc 
ttgtcaagac cgacctgtcc ggtgccctga 
cgtggctggc cacgacgggc gttccttgcg 
gaagggactg gctgctattg ggcgaagtgc. 
ctcctgccga gaaagtatcc atcatggctg 
cggctacctg cccattcgac caccaagcga 
tggaagccgg tcttgtcgat caggatgatc 
ccgaactgtf cgccaggctc aaggcgcgca 
atggcgatgc ctgcttgccg aatatcatgg 
actgtggccg gctgggtgtg gcggaccgct 
ttgctgaaga gcttggcggc gaatgggctg 
ctcccgattc gcagcgcatc gccttctatc 
gatccgctgt aagtctgcag aaattgatga 
tggaagtttt tcctgtcata ctttgttaag 
atggaaggat tggagctacg ggggtggggg 
tactgaaggc tctttactat tgctttatga 
aacaagcaaa accaaattaa gggccagctc 
agatctctcg tgggatcatt gtttttctct 
qgtttccaaa tgtgtcagtt tcatagcctg 
tacacttcat tctcagtatt gttttgccaa 
tcctgcagga attcatatga taacttcgta 
cggtaaccga agttcctata ctttctagag 
ttaagcgcta gaagatgggc gggagtcttc 
gggcgttgtc ctgcagggga attgaacagg 
gattttcggt tttgtcggga agttttttaa 
gtagtcatct ggggttttat gcagcaaaac 
ggagtatttt ccatcgaggt agattaaaga 
ttgagatcct tactacagta tgaaattaca 
tttaatcatt tttaaagagc ccagtacttc 
ttatcaaaag gtattttaga acactcattt 
atccaacccc tagacagagc attggcattt 
tcatgaaacc agacagatta gttacataca 
acactgcagt tcttttataa ctccttagta 
taattttcag tgtctatcac ctctcccgtc 
gt.ccagggag ttttacaaca atagatgtat 



gaaggggcaa gcgggtggtg ggcaggaatg 120 
ggagaaggga gcggaaaag.t ctccaccgga 180 
cgga'ggagcg cttccggccg acgtctcgtc 24 0 
gtgaaaacac aaatggcgtg ttttggttgg 300 
gagtgcgcag ccgccggcag cctcgctctg 360 
tgaggcgagc tggacgtgcg ggcgcggtcg 420 
ggtcagcgaa agtagctcgc gcgcgagcgg 480- 
cgttttaccc gccgccggcc gggcctcgtc 540 
ggcccttgcc attggctcgt -gtt'cgtgcaa 600 
gcgaggaggc gctcccaggt tccggccctc 660 
cgcgcccctg cgcaacgtgg caggaagcgc 720 
gagcggctgc ggggcgggtg caagcacgtt 780 
ctgagccaga cctccatcgc gcactccggg 840 
ggctgttttg gaggcaggaa gcacttgctc 900 
agggagctgc agtggagtag gcggggagaa 960 
agtgttgcaa tacctttctg- ggagttctct 1020 
ggcctgggag aatcccttcc ccctcttccc 1080 
aaccgatatc cctgcagggg tgacctgcac 1140 
gatgatgtca tacttatcct gtcccttttt 1200 
ttcgcggtct ttccagtact cctgcaggtg. 1260 
cttcgtataa tgtatgctat acgaagttat 1320 
cgagggcccc tgcaggtcaa ttctaccggg 1380 
gagcatgcgc tttagcagcc ccgctgggca 1440 
gcacacattc cacatccacc ggtaggcgcc 1500 
cgccaccttc tactcctccc ctagtcagga 1560 
gcaggacgtg acaaatggaa gtagcacgtc 1620 
ctgagcaatg gaagcgggta ggcctttggg 1680 
ctttctgggc tcagaggctg gg'aaggggtg 1740 
gggcggggcg ggcgcccgaa ggtcctccgg 1800 . 
•cacgtctgcc gcgctgttct cctcttcctc 1860 
tgggatcggc cattgaacaa gatggattgc 1920 
ggctattcgg ctatgactgg gcacaacaga 1980 
ggctgtcagc gcaggggcgc ccggttcttt 204 0' 
atgaactgca ggacgaggca gcgcggctat 2100 
cagctgtgct cgacgttgtc actgaagcgg 2160 
cggggcagga tctcctgtca' tctcaccttg 2220 ■ 
atgcaatgcg gcggctgcat acgcttgatc 2280- 
aacatcgcat cgagcgagca cgtactcgga 2340 
tggacgaaga gcatcagggg ctcgcgccag 2400 
tgcccgacgg cgaggatctc. gtcgtgaccc 2.4.60 
tggaaaatgg ccgcttttct ggattcatcg 2520 ' 
atcaggacat agcgttggct acccgtgata 2580 
accgcttcct cgtgctttac ggtatcgccg 2640- 
gccttcttga cgagttcttc-tgaggggatc- 2700 
tctattaaac aataaagatg tccactaaaa 2760 
aagggtgaga acagagtacc tacattttga 2820 
tggggtggga ttagataaat gcctgctctt 2880 
taatgtttca tagttggata tcataattta 2940 
attcctccca ctcatgatct atagatctat 3000 
tgattcccac tttgtggttc taaqtactgt 3060 
aagaacgaga tcagcagcct ctgttccaca 3120 
gttctaattc catcagaagc tgactctaga 3180 
taatgtatgc tatacgaagt tatggcgcgc 3240 
aataggaact tcggaatagg aacttcaagc 3300 
tgggcaggct taaaggctaa cctggtgtgt 3360 
tgtaaaattg gagggacaag acttcccaca 3420 
'taggggcaaa taaggaaaat gggaggatag 3480 
tacaggttat tattgcttgt gatccgcctc 3540 
catgctcacc cgagttttat actctcctgc 3600 
gtgtcgcgag ttagactatg taagcagaat 3660 
atatccattt ctcccgctcc ttctgcagcc 3720 
tagccccatt ttcatttatt atactggctt 3780 
tccctttcct gatcttagaa gtctgatgac 3840 
ccacaaatcg aggctgtagc tggggcctca 3900 
cactttttgt tgatcctttg ccttgatcct 3960 
agtggtgttc. cacatttggg cctattctca 4020 
tgagaatcca acctaaagct taactttcca 4080 
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ctcccatgaa tgcctctctc ctttttctcc 
gttccaggtg gatgtctcct ccccatatta 
tattttaaga cattaaaagg. tatatttcat 
actaaaattt tgtcattgta cacatctgta 
aggtgtttgt tgtctttcct gacctaaggt 
agtgctttct cttggactgg cttgactcat 
gtgattttgc caagcttctt caggacctat 
taaaatgatt aagcaacaaa tgtatttgtg 
gtgcttgtgc tctataataa tactatccag 
cacagactgc tcttccagaa gtcctgagtt 
accatctgta atgggatctg atgccctctt. 
acattaaata aataaatcct ccttcttctt 
tccagtagaa tttactgaag taatgaaata 
aatcaaatta ctctttaagc actggaaatg 
taactgtgga cagaggagcc ataactgcag 
tttaatgtct tttctcttac actaagcaat 
tatttgttta aactgctagc tttacttaac 
ctaagtctgc agccattact aaacatgaaa 
tgtagggcca gagtttagcc agccagtggt 
actctggagg cagagacagg cagatctctg 
gttctatcta ggatagccag gaatacacac 
atttcataaa attataattg aagcattccc 
gtctaccttt ctgatgagat ttgggtatta 
tttgacactg tgggctttct ttaaagcctc 
taacttccca tggcttaaat ggcatggctt 
gcagcctgat ttccagggtg gggttgggaa 
ttttttttta aaaaatgggt tatataataa 
ggactaataf taaatgagtc cctcccctat 
cttaactttt tttttaaatg tggtatcttt 
agaaactgtt gcatcgctta atcagatttt 
tcacagccaa agtcaaatta agaatttctg 
.ataaaaatga tagcttttcc .tgaggcaggg 
caagatatgt agactaaagt tctgcctgct 
agtaatactt ttggaacttg caggtcagat 
gggtgatagt tggtaaaatg tgtttcaagt 
■ctacttttta aaaaaaaaag ccaggcctgt 
gctgagcaca caagagtagt tacttggcag 
aggcagacaa ccaagaaact acagttaagg 
cagggatatt aaaatattcc aaataatatt 
catggatttc tccggtgaat aggcagagtt 
tgtgaaattg ttttcaagtg atagttaaag 
cgaggtctct tggt'ttatac tcagaagcac 
atgtgctagg cctaccttag gctgattgtt 
gatgtcatat gatttcatat . atcaaggcaa . 
taatgtgaaa gttaggtctt tgtgggtttg 
agtcattttt acatgtctta catttggtgg 
tctctgacct agtaacccta cctatagagc 
gcatttcacc ttgaagttga gacgttttgt 
atgtttatcc agaagatatt caggactatt 
agcactgtta gtgagcattg agtggccttt 
aaataatgct ggcctttttt aaaaagccct 
ttttgttcaa agaaatactt gtttggatct 
catatttttt ttcctttttt tttttttttt 
gccctggctg tcctggaact cactttgtag 
ctgcctctgc ctcctgagtg ccgggattaa 
gatattttgt tatataacta taaccaatac 
cagtagtctt aagtggtctt tattggccct 
ggctgttggt actagtggca cttaagcaac 
cagggataga aactagtcta gcgttttgta 
atagaaatat ttcaggacat ctagcacggg 
ttggacaaac cacaactaga atgcagtgaa 
ctattgcttt atttgtaac.c attataagct 
ttcattttat gtttcaggtt cagggggagg 
tctacaaatg tggtatggct gattatgatc 
gccggaggcg atggcgtgct cggtcaggtg 
ctcgcgggtc agcttgtgct ggatgaagtg 
ggccttgtac acggtgtcga actggcagcg 



atttataaac tgagctatta accattaatg 4140 
cctgatgtat cttacatatt gccaggctga 4200 
tattgagcca catggtattg attactgctt 4260 
aaaggtggtt ccttttggaa tgcaaagttc 4320 
cttgtgagct tgtatttttt ctatttaagc 4380 
ggcattctac acgttattgc tggtctaaat 4440 
aattt.tgctt gacttgtagc caaacacaag 4 500 
aagcttggtt tttaggttgt tgtgttgtgt 4560 
gggctggaga ggtggctcgg agttcaagag 4 620 
caattcccag caaccacatg gtggctcaca 4 680- 
ctggtgtgtc tgaagaccac aagtgtattc 474 0 
cttttttttt tt-tttaaaga gaatactgtc 4 800 
ctttgtgttt gttccaatat ggtagccaat 4860 
ttaccaagga actaattttt atttgaagtg 4920 
acttgtggga tacagaagac caatgcagac 4980 
aaagaaataa aaattgaact tctagtatcc 5040 
ttttgtgctt catctataca aagctgaaag' 5100 
gcaagtaatg ataattttgg atttcaaaaa 5160 
ggtgcttgcc tttatgcctt taatcccagc 5220 
agtttgagcc cagcctggtc tacacatcaa 5280 
agaaaccctg ttggggaggg gggctctgag 5340 
taatgagcca ctatggatgt ■ ggcta.aatcc 5400 
ttttttctgt ctctgctgtt ggttgggtct 54 60 
cttcctgcca .tgtggtctct • tgtttgctac 5520 
tttgccttct aagggcagct gctgagattt 5580 
atctttcaaa cactaaaatt gtcctttaat 5640 
acctcataaa atagttatga ggagtgaggt 5700 
aaaagagcta ttaaggcttt tt'gtcttata 57 60 
agaaccaagg gtcttagagt tttagtatac 5820 
ctagtttcaa atccagagaa tccaaattct 5880 
acttttaatg ttaatttgct tactgtgaat 5940 
tctcactatg tatctctgcc tgatctgcaa 6000 
tttgtctcct gaatactaag gttaaaatgt 6060 
tcttttatag gggacacact aagggagctt 6120 
gatgaaaact tgaattatta tcaccgcaac 6180 
tagagcatgc ttaagggatc cctaggactt 6240 
gctcctggtg agagcatatt tcaaaaaaca 6300 
ttacctgtct ttaaaccatc tgcatataca 6360 
tcattcaagt tttcccccat caaattggga 6420 
ggaaactaaa caaatgttgg ttt.tgtgatt 6.4 80 
cccatgagat acagaacaaa gctgctattt 6540. 
ttctttgggt ttccctgcac tatcctgatc 6600 
gttcaaataa acttaagttt. cctgtcaggt 6660 
aacatgttat atatgttaaa catttgtact 6720 
atttttaatt ttcaaaacct gagctaaata 6780 
aattgtataa ttgtggtttg caggcaagac 6840 
actttgctgg gtcacaagtc taggagtcaa 6900 
tagtgtatac tagtttatat gttggaggac 6960 
tttgactggg ctaaggaatt gattctgatt 7020 
aggcttgaat tggagtcact tgtatatctc 7080 
tgttctttat caccctgttt tctacataat 7140 
ccttttgaca acaatagcat gttttcaagc 7200 
tggtttttcg agacagggtt tctctgtata 7260 
accaggctgg cctcgaactc agaaatccgc 7320 
aggcgtgcac caccacgcct ggctaagttg 7380 
taactccact gggtggattt ttaattcagt 7440 
tcattaaaat ctactgttca ctctaacaga 7500 
ttcctacgga tatactagca gattaagggt '7560 
tacctaccag ctttatacta ccttgttctg 7620 
caccggtgtt tttaagatac attgatgagt 7680 
aaaaatgctt tatttgtgaa atttgtgatg 7740 
gcaataaaca agttaacaac aacaattgca 7800 
tgtgggaggt tttttaaagc aagtaaaacc 7860 
tagagtcgcg gccgctcagg gcaaggcgga 7 920 
ccacttctgg ttcttggcgt cgctgcggtc 7980 
ccagtcgggc atcttgcggg gcacggactt 8040 
caagcggcca ccgtccttca gcagcaggta 8100 
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catgctcacg tcgcccttca agatgccctg 
gggctcccag ttgtcggtca tcttctfcat 
•gtagaacttg gactcgtggt acatgcagtt 
gcagatgcac acggcgccgt cctcgaacag 
gcaggagttc ttgaagtagt cgacgatgtc 
gtacatgaag gcggcggaca agatgtcctc 
gcacaggttg atggcctgct tgcccttgaa 
gaacttgtgg ccgtccacgc agccctccat 
•gccgtgcttg gactgggcca tggtggcgac 
aaagaagaag gcatgaacat ggttagcaga 
tgtggagaga aaggcaaagt ggatgtcaat 
ctgccacacc tcaagcttga caacaaaaag 
accctcaaag gcatcaccgc gggccaggtg 
gacaatctta gcgcagaagt catgcccgct 
ctagagccgc cggtcacacg ccagaagccg 
cagccgtccc cctgcggcag ccccgaggct 
acgcacgaag gccctccccg cccatttcct 
cccgctagag ggggtgcggc ggcgcctccc 
ggaagtccct gcgccctctc gcacgattac 
cctcgacagc cgccggcgct ccggggccgc 
gaggcggccc cgccccgccc ggcaccccca 
gccccgcgcc acgccccgca cggagccccg 
ggggaggggg gtgcaggggg gggttacagc 
ccccccacgc acacaccccg cacgcagcct 
gggcaccgcc cccggccgcg ctcccctcgc 
gcgcccgcag cgctcacagc cgccgggcag 
acacacacgc acgcaccccc cgagccgctc 
tttaaggctt tcacgcagcc acagaaaaga 
acagcccgga ggagaagggc cgtcccgccc 
gagccggggc gggcggcgcg aggcggcgcg 
gactcccgcc cgccgcgcgc ttcgcttttt 
aaaaggaaac tttcggagcg cgccgctctg 
cpcgccccgc ccctcgcccc gccccgcccc 
ccatcgctgc acaaaataat taaaaaataa 
gggagatggg gagagtgaag cagaacgtgg 
actaatacgt agatgtactg ccaagtagga 
tgccaggcgg gccatttacc gtcattgacg 
acttgatgta ctgccaagtg ggcagtttac 
aaagtcccta ttggcgttac tatgggaaca 
tcgttgggcg gtcagccagg cgggccattt 
tatgggctat .gaactaatga ccccgtaatt 
tgtcgaaaac ctgcaggcat gcaagctagc 
ctatttaaat ggccggcccc tgcaggccgc 
cgtattacra ttcactggcc gtcgttttac 
cccaacttaa tcgccttgca gcacatcccc 
cccgcaccga tcgcccttcc caacagttgc 
gtagcggcgc attaagcgcg gcgggtgtgg 
ccagcgccct agcgcccgct cctttcgctt 
gctttccccg tcaagctcta aatcgggggc 
ggcacctcga ccccaaaaaa cttgattagg 
gatagacggt ttttcgccct ttgacgttgg 
tccaaactgg aacaacactc aaccctatct 
tgccgatttc ggcctattgg ttaaaaaatg 
ttaacaaaat attaacgctt acaatttagg 
ccctatttgt ttatttttct aaatacattc 
ctgataaatg cttcaataat attgaaaaag 
cgcccttatt cccttttttg cggcattttg 
ggtgaaagta aaagatgctg aagatcagtt 
• tctcaacagc ggtaagatcc ttgagagttt 
cacttttaaa gttctgctat gtggcgcggt 
actcggtcgc cgcatacact attctcagaa 
aaagcatctt. acggatggca tgacagtaag 
tgataacact gcggccaact tacttctgac 
ttttttgcac aacatggggg atcatgtaac 
tgaagccata ccaaacgacg agcgtgacac 
gcgcaaacta ttaactggcg aactacttac 
gatggaggcg gataaagttg caggaccact 



cttgggcacg gggatgatct tctcgcagga 8160 
cacggggccg tcggcgggga agttcacgcc 8220 
ctcctccacg ctcacggtga tgtcggcgtt 8280 
gaaggagcgg tcccaggtgt agccggcggg 8340 
ctgggggtac tcggtgaaca cgcggttgcc 8400 
ggcgaagggc aaggggccgc cctccaccac 84 60 
ggggtagccg atgccctcgc cggtgatcac 8520 
•gcggtacttc atggtcatct ccttggtcag 8580 
cggtggatcc actagttcta gctgtaggaa 8640 
ggctctaggg.-ccgccctggg . agtggacacc 8700 
gtcactcaag- tgtatggcca gatctcaagc 87 60 
attgtctttt ctgaccagat • ggacgcggcc 8820 
aatatcaaat . cctcctcgtt tttggaaact 8880' 
tttgagaggg agtactcacc ccaacgctta 8 94 0 
aaccccgccc tgccccgtcc cccccgaagg 9000 
ggagatggag aaggggacgg" cggcgcggcg 9060 • 
tcctgccggc gccgcaccgc ttcgcccgcg' 9120 
agatttcggc tccgccagat ttgggacaaa 9180 
.cataaaaggc aatggctgcg gctcgccgcg 9240 
cgcgcccctc ccccgagccc tccccggccc 9300 
cctgccgcca ccccccgccc ggcacggcga 9360 
cacccgaagc cgggccgtgc tcagcaactc 94 20 
ccgaccgccg cg.cccacacc ccctgctcac 94 80 
ttgt tcccct cgcagccccc ccgcaccgcg . 954 0 
gcacacgcgg agcgcacaaa gccccgcgcc 9600 
cgcgggccgc acgcggcgct ccccacgcac 9660 
ccccccgcac aaagggccct cccggagccc 9720 
aacgagccgt cattaaacca agcgctaatt 9780 
gctcacctgt gggagtaacg cggtcagtca 9840 
gagcggggca cggggcgaag gcaacgcagc 9900 
atagggccgc cgccgccgcc gcctcgccat 9960 
attggctgcc gccgcacctc tccgcctcgc 10020 
gcctggcgcg cgcccccccc ccccccgccc 10080 
ataaatacaa aattgggggt gggigaggggg 1014 0 
ggctcacctc gacccatggt aatagcgatg 10200 
aagtcccata aggtcatgta ctgggcataa 10260 
tcaatagggg gcgtacttgg catatgatac 10320. 
cgtaaatagt ccacccattg acgtcaatgg 10380 
tacgtcatta ttgacgtcaa tgggcggggg 10440 
accgtaagtt atgtaacgcg gaactccata 1.0500 
gattactatt aataactagt caataatcaa 10560 
ccaaacctat attaccctgc cctagcgtaa 10620 
ggtggagctc caattcgccc tatagtgagt 10680 
aacgtcgtga. ctgggaaaac cctggcgtta .10740 
ctttcgccag ctggcgtaat agcgaagagg 10800 
gcagcctgaa tggcgaatgg gacgcgccct ' 10860 
tggttacgcg cagcgtgacc gctacacttg 10920 
tcttcccttc ctttctcgcc acgttcgccg 10980 
tccctttagg gttccgattt agtgctttac 11040 
gtgatggttc acgtagtggg ccatcgccct 11100 
agtccacgtt ctttaatagt ggactcttgt 11160 
cggtctattc ttttgattta taagggattt 11220 
agctgattta acaaaaattt aacgcgaatt 11280 
tggcactttt cggggaaatg tgcgcggaac 11340 
aaatatgtat ccgctcatga gacaataacc 114.00 
gaagagtatg agtattcaac atttccgtgt 114 60 
ccttcctgtt tttgctcacc cagaaacgct 11520 
gggtgcacga gtgggttaca tcgaactgga 11580 
tcgccccgaa gaacgttttc caatgatgag 11640 
attatcccgt attgacgccg ggcaagagca* 11700 
tgacttggtt gagtactcac cagtcacaga 11760 
agaattatgc agtgctgcca taaccatgag 11820 
aacgatcgga ggaccgaagg agctaaccgc 11880 
tcgccttgat cgttgggaac cggagctgaa 11940 
cacgatgcct gtagcaatgg caacaacgtt 12000 
tctagcttcc cggcaacaat taatagactg. 12060* 
tctgcgctcg gcccttccgg ctggctggtt 12120 
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tattgctgat 
.gccagatggt 
ggatgaacga 
gtcagaccaa 
aaggatctag 
ttcgttccac 
ttttctgcgc 
tttgccggat 
gataccaaat 
agcaccgcct 
taagtcgtgt 
gggctgaacg 
gagataccta 
caggtatccg. 
aaacgcctgg 
tttgtgatgc 
acggttcctg 
ttctgtggat 
gaccgagcgc 
tctccccgcg 
agcgggcagt 
tttacacttt 
cacaggaaac 
acaaaagctg 



aaatctggag 
aagccctccc 
aatagacaga 
gtttactcat 
gtgaagatcc 
tgagcgtcag 
gtaatctgct 
caagagctac 
actgtccttc 
acatacctcg 
cttaccgggt 
gggggttcgt 
. cagcgtgagc 
gtaagcggca 
tatctttata 
tcgtcagggg 
gccttttgct 
aaccgtatta 
agcgagtcag 
cg.ttggccga 
gagcgcaacg 
atgcttccgg 
agctatgacc 
tcgagatcta 



ccggtgagcg 
gtatcgtagt 
tcgctgagat 
atatacttta 
tttttgataa 
accccgtaga 
gcttgcaaac 
caactctttt' 
tagtgtagcc 
ctctgctaat 
tggactcaag 
gcacacagcc 
tatgagaaag 
gggtcggaac 
gtcctgtcgg 
ggcggagcct 
ggccttttgc 
ccgcctttga 
tgagcgagga 
ttca'ttaatg 
caattaatgt 
ctcgtatgtt 
atgattacgc 
gatatcgatg 



tgggt'ctcgc 
tatctacacg 
aggtgcctca 
gattgattta 
tctcatgacc 
aaagatcaaa 
aaaaaaacca 
tccgaaggta 
gtagttaggc 
cctgttacca 
acgatagtta 
cagcttggag 
cgccacgctt 
aggagagcgc 
gtttcgccac 
atggaaaaac 
tcacatgttc 
gtgagctgat 
agcggaagag 
cagctggcac 
gagttagctc 
gtgtggaatt 
caagcgcgca 
gccatagagt 



ggtatcattg 
•acggggagtc 
ctgat'taagc 
aaacttcatt 
aaaatccctt 
ggatcttctt 
ccgctaccag' 
actggcttca 
caccacttca 
gtggctgctg 
ccggataagg 
cgaacgacct 
cccgaaggga 
acgagggagc 
ctctgacttg 
gccagcaacg 
tttcctgcgt 
accgctcgcc 
cgcccaatac 
gacaggtttc 
actcattagg 
gtgagcggat 
attaaccctc 
tacg 



cagcactggg 
aggcaactat 
attggtaact 
tttaatttaa 
aacgtgagtt 
gagatccttt 
cggtggtttg 
gcagagcgca 
agaactctgt 
ccagtggcga 
cgcagcggtc 
acaccgaact 
gaaaggcgga 
ttccaggggg 
agcgtcgatt 
.cggccttttt 
tatcccctga 
gcagccgaac 
gcaaaccgcc 
ccgactggaa 
caccccaggc 
aacaatttca 
actaaaggga 



12180 
12240 
12300 
12360 
12420 
12480' 
12540 
12600 
12660 
12720 
12780 
12840 
12900 
12960 
13020 
13080 
13140 
13200 
13260 
13320 
13380 
13440 
13500 
13544 



<210> 7 

<211> 12739 ■ • 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Rosa 26 
targeting vector pOMPl with TK 



<400> 7 

ctagggataa 

tgagacagcc 

cggtccgccc 

cgcggccatg 

gctgattggc 

cgtaaggcgc 

cccactgggt 

gcc.tctggcg 

ccgcccaccc 

gtctgattgg 

gttgagtcca 

ccctcggccc 

gcgctggggg 

tccgacttga 

gagtggaggg 

tcccaaagtc 

gqccgcaccc 

gctgcctcct 

tcgtgatctg 

gtctagggcg 

ttt.ccacagc 

actgactgag 

ttataatcta 

taggggaggc 

cttggcgcta 

aaccggctcc 

agttcccccc 

tcactagtct 

gcagcggcca 

ggtccggggg 

aggcccggca 



cagggtaata 
gggtacgagt 
tgcagcaacc 
gctcgggggg 
ttcttttcct 
ctgtcagtta 
ggggcgggag 
gggcggggga 
tccccttcct 
ctctcggggc 
tccgccggcc 
cgcgccgcag 
cgqggacggg 
gttgcctcaa 
aaggagcgag 
gctctgagtt 
ttctccggag 
ggcttctgag 
caactccagt 
cagtagtcca 
tcgcggttga 
tcgactaggg 
gaactagtgg 
gcttttccca 
cacaagtggc 
gttctttggt 
cgccccgcag 
cgtgcagatg 
atagcagctt 
cgggctcagg 
ttctgcacgc 



tagccgcggc 
egtgacgctg 
ggagggggag 

gggggggcag 

.cccgccgtgt 
acggcagccg 
gtaggtgggg 
ggggagggag 
ctgggggagt 
ccagaaaact 
agcgggggcg 
agtctggccg 
cagtagggct 
gaggggcgtg 
ggctcagttg 
gttatcagta 
gggggagggg 
gaccgccctg 
ctttctaggt 
gggtttcctt 
ggacaaactc 
gtaaccataa 
atccacgatt 
aggcagtctg 
ctctggcctc 
ggccccttcg 
ctcgcgtcgt 
gacagcaccg 
tgctccttcg 
gcgggctcag 
ttcaaaagcg 



agg.ccctccg 
gaaggggcaa 
ggagaaggga 
cggaggagcg 
gtgaaaacac 
gagtgcgcag 
tgaggcgagc 
ggtcagcga.?. 
cgttttaccc 
ggcccttgcc 
gcgaggaggc 
cgcgcccctg 
gagcggctgc 
ctgagccaga 
ggctgttttg 
agggagctgc 
agtgttgcaa 
ggcctgggag 
aaccgatatc 
gatgatgtca 
ttcgcggtct 
cttcgtataa 
cgagggcccc 
gagcatgcgc 
gcacacattc 
cgccaccttc 
gcaggacgtg 
ctgagcaatg 
ctttctgggc 
gggcggggcg 
cacgtctgcc 



agcgtggtgg 
gcgggtggtg 
gcggaaaagt 
cttccggccg 
aaatggcgtg 
ccgccggcag 
tggacgtgcg 
agtag.ctcgc 
gccgccggcc 
attggctcgt 
gctcccaggf 
cgcaacgtgg 
ggggcgggtg 
cctccatcgc 
gaggcaggaa 
agtggagtag 
tacctttctg 
aatcccttcc 
cctgcagggg 
tacttatcct 
ttccagtact 
tgtatgctat 
tgcaggtcaa 
tttagcagcc 
cacatccacc 
tactcctccc 
acaaatggaa 
gaagcgggta 
tcagaggctg 
ggcgcccgaa 
gcgctgttct 



agccgttctg 
ggcaggaatg 
ctccaccgga 
acgtctcgtc 
ttttggttgg 
cctcgctctg 
ggcgcggtcg 
gcgcgagcgg 
gggcctcgtc 
gttcgtgcaa 
tccggccctc 
caggaagcgc 
caagcacgtt 
gcactccggg 
gcacttgctc 
gcggggagaa 
ggagttctct 
ccctcttccc 
tgacctgcac 
gtcccttttt 
cctgcaggtg 
acgaagttat 
ttctaccggg 
ccgctgggca 
ggtaggcgcc 
ctagtcagga 
gtagcacgtc 
ggcctttggg 
ggaaggggtg 
ggtcctccgg 
cctcttcctc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620. 

1680 

1740 

1800 

1860 
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atctccgggc ctttcgacct gcagccaata tgggatcggc cattgaacaa gatggattgc 1920 
acgcaggttc tccggccgct tgggtggaga ggctattcgg ctatgactgg gcacaacaga 1980 
caatcggctg ctctgatgcc gccgtgttcc ggctgtcagc gcaggggcgc ccggttcttt 2040 
ttgtcaagac cgacctgtcc ggtgccctga atgaactgca ggacgaggca gcgcggctat 2100 
cgtggctggc cacgacgggc gttccttgcg cagctgtgct cgacgttgtc actgaagcgg 2160 
gaagggactg gctgctattg ggcgaagtgc cggggcagga tctcctgtca tctcacct.tg 2220 
ctcctgccga gaaagtatcc atcatggctg atgcaatgcg gcggctgcat. acgcttgatc 2280 
cggctacctg cccattcgac caccaagcga aacatcgcat cgagcgagca cgtactcgga 23.4 0 
tggaagccgg tcttgtcgat caggatgatc tggacgaaga gcatcagggg ctcgcgccag 2400 
ccgaactgtt cgccaggctc aaggcgcgca tgcccgacgg cgaggatctc gtcgtgaccc 24 60 
atggcgatgc ctgcttgccg aatatcatgg tggaaaatgg ccgcttttct ggattcatcg 2520 
actgtggccg gctgggtgtg gcggaccgct atcaggacat agcgttggct acccgtgata 2580 
ttgctgaaga gcttggcggc gaatgggctg accgcttcct cgtgctttac ggtatcgccg 2640 
ctcccgattc gcagcgcatc gccttctatc gccttcttga cgagttcttc tgaggggatc 2700 
gatccgctgt aagtctgcag aaattgatga . tctattaaac aataaagatg tccactaaaa 2760 
tggaagtttt tcctgtcata ctttgttaag aagggtgaga acagagtacc tacattttga 2820 
atggaaggat tggagctacg ggggtggggg tggggtggga t'tagataaat gcctgctctt 2880 
tactgaaggc tctttactat . tgctttatga taatgtttca tagttggata tcataattta 2940 
aacaagcaaa accaaattaa gggccagctc attcctccca ctcatgatct atagatctat 3000 
agatctctcg tgggatcatt gtttttctct tgattcccac tttgtggttc taagtactgt 3060 
ggtttccaaa tgtgtcagtt tcataqcctg aagaacgaga tcagcagcct ctgttccaca 3120 
tacacttcat tctcagtatt gttttgccaa gttctaattc catcagaagc tgactctaga 3180 
tcctgcagga attcatatga taacttcgta taatgtatgc tatacgaagt tatggcgcgc 3240 
cggtaaccga agttcctata ctttctagag aataggaact tcggaatagg aacttcaagc 3300 
ttaagcgcta gaagatgggc gggagtcttc tgggcaggct taaaggctaa cctggtgtgt 3360 
gggcgttgtc • ctgcagggga attgaacagg tgtaaaattg gagggacaag acttcccaca' 3420 
gattttcggt tttgtcggga agttttttaa taggggcaaa taaggaaaat gggaggatag 3480- 
gtagtcatct ggggttttat gcagcaaaac tacaggttat tattgcttgt gatccgcctc 3540 
ggagtatttt ccatcgaggt agattaaaga catgctcacc cgagttttat actctcctgc 3600 
ttgagatcct tactacagta tgaaattaca gtgtcgcgag ttagactatg taagcagaat 3660 
tttaatcatt tttaaagagc ccagtacttc atatccattt ctcccgctcc ttctgcagcc 3720 
ttatcaaaag gtattttaga acactcattt tagccccatt ttcatttatt atactggctt 3780 
atccaacccc tagacagagc attggcattt tccctttcct gatcttagaa gtctgatgac 3840 
tcatgaaacc agacagatta gttacataca ccacaaatcg aggctgtagc tggggcctca 3900 
acactgcagt tcttttataa ctccttagta cactttttgt tgatcctttg ccttgatcct .3960 
taattttcag tgtctatcac ctctcccgtc agtggtgttc cacatttggg cctattctca . 4020 
gtccagggag ttttacaaca atagatgtat tgagaatcca acctaaagct taactttcca 4080 
ctcccatgaa tgcctctctc ctttttctcc atttataaac tgagctatt-a accattaatg 4140 
gttccaggtg' gatgtctcct ccccatatta cctgatgtat cttacatatt gccaggctga 4200 
tattttaaga cattaaaagg tatatttcat tattgagcca catggtattg attactgctt 4.260 
actaaaattt tgtcattgta cacatctgta aaaggtggtt ccttttggaa tgcaaagttc 4320 
aggtgtttgt tgtctttcct gacctaaggt- cttgtgagct tgtatttttt ctatttaagc 4380 
agtgctttct cttggactgg cttgactcat ggcattctac acgttattgc tggtctaaat 4440 
g^-gattttgc caagcttctt caggacctat aattttgctt .gacttgtagc caaacacaag -4 500 ■ 
taaaatgatt aagcaacaaa tgtatttgtg aagcttggtt tttaggttgt tgtgttgtgt 4560 
gtgcttgtgc tctataataa tactatccag gggctggaga ggtggctcgg agttcaagag 4 620 
cacagactgc tcttccagaa gtcctgagtt caattcccag caaccacatg gtggctcaca 4 680 
accatctgta atgggatctg atgccctctt ctggtgtgtc tgaagaccac aagtgtattc 4740 
acattaaata aataaatcct ccttcttctt cttttttttt tttttaaaga gaatactgtc 4800 
tccagtagaa tttactgaag taatgaaata ctttgtgttt gttccaatat ggtagccaat 4860 
aatcaaatta .ctctttaagc actggaaatg ttaccaagga actaattttt atttgaagtg 4 920 
taactgtgga' cagaggagcc ataactgcag acttgtggga tacagaagac caatgcagac 4 980 
tttaatgtct tttctcttac actaagcaat aaagaaataa aaattgaact tctagtatcc 5040 
tatttgttta aactgctagc tttacttaac ttttgtgctt catctataca aagctgaaag 5100 
ctaagtctgc agccattact aaacatgaaa gcaagtaatg ataattttgg atttcaaaaa 51o0 
tgtagggcca gagtttagcc agccagtggt ggtgcttgcc tttatgcctt taatcccagc 5220 
actctggagg cagagacagg cagatctctg agtttgagcc cagcctggtc tacacatcaa 5^.80 
gttctatcta ggatagccag gaatacacac agaaaccctg ttggggaggg gggctctgag 5340 
atttcataaa attataattg aagcattccc taatgagcca ctatggatgt ggctaaatcc 5400 
gtctaccttt ctgatgagat ttgggtatta ttttttctgt ctctgctgtt ggttgggtct 54 60 
tttgacactg ' tgggctttct ttaaagcctc cttcctgcca . tgtggtctct tgtttgctac 5520 
taacttccca tggcttaaat .ggcatggctt tttgccttct aagggcagct gctgagattt 5580. 
• gcagcctgat ttccagggtg gggttgggaa atctttcaaa cactaaaatt gtcctttaat 5640 
ttttttttta aaaaatgggt tatataataa acctcataaa atagttatga ggagtgaggt 5700 
ggactaatat taaatgagtc cctcccctat aaaagagcta ttaaggcttt ttgtcttata 5760 
cttaactttt tttttaaatg tggtatcttt agaaccaagg gtcttagagt tttagtatac 5820 
agaaactgtt gcatcgctta atcagatttt ctagtttcaa atccagagaa tccaaattct 5880 
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tcacagccaa agtcaaatta agaatttctg 
ataaaaatga tagcttttcc tgaggcaggg 
caagatatgt agactaaagt tctgcctgct 
agtaatactt ttggaacttg caggtcagat. 
gggtgatagt tggtaaaatg tgtttcaagt 
ctacttttta aaaaaaaaag ccaggcctgt 
gctgagcaca caagagtagt tacttggcag 
aggcagacaa ccaagaaact acagttaagg 
cagggatatt aaaatattcc aaataatatt 
catggatttc tccggtgaat ■• aggcagagtt 
tgtgaaattg ttttcaagtg atagttaaag 
cgaggtctct tggtttatac tcagaagcac 
atgtgctagg cctaccttag gctgattgtt 
gatgtcatat gatttcatat atcaaggcaa 
taatgtgaaa gttaggtctt tgtgggtttg 
agtcattttt acatgtctta catttggtgg 
tctctgacct agtaacccta cctatagagc 
gcatttcacc ttgaagttga gacgttttgt 
atgtttatcc agaagatatt caggactatt 
agcactgtta gtgagcattg 'agtggccttt 
aaataatgct ggcctttttt ' aaaaagccct 
ttttgttcaa agaaatactt gtttggatct 
catatttttt ttcctttttt tttttttttt 
gccctggctg tcctggaact cactttgtag 
ctgcctctgc ctcctgagtg ccgggattaa 
gatattttgt tatataacta taaccaa.tac 
cagtagtctt aagtggtctt tattggccct 
ggctgttggt actagtggca cttaagcaac 
cagggataga aactagtcta gcgttttgta 
atagaaatat ttcaggacat ctagcacgtg 
ggcaggtcaa ttcta'ccggg taggggaggc 
tttagcagcc. ccgctgggca cttggcgcta 
cacatccacc ggtaggcgcc aaccggctcc 
tactcctccc ctagtcagga agttcccccc 
acaaatggaa gtagcacgtc tcactagtct 
gaagcgggta ggcctttggg gcagcggcca 
tcagaggctg ggaaggggtg ggtccggggg 
gggcgeccga aggtcctccg gaggcccggc 
cgcgctgttc tcctcttcct catctccggg 
tgccatcatg gcctcgtacc ccggccatca 
ttctcgcggc catagcaacc gacgtacggc 
ggaagtccgc ccggagcaga aaatgcccac 
cgggatgggg aaaaccacca ccacgcaact 
cgtctacgta cccgag.ccga tgacttactg 
gaacatctac accacacaac accgcctcga 
ggtggtaatg acaagcgccc agataacaat 
tctggctcct catatcgggg gggaggctgg 
cctcatcttc gaccgccatc ccatcgccgc 
tatgggcagc atgacccccc aggccgtgct 
gcccggcacc aacatcgtgc ttggggccct 
caaacgccag cgccccggcg agcggctgga 
ttacgggcta cttgccaata cggtgcggta 
ctggggacag ctttcgggga cggccgtgcc 
gggcccacga ccccatatcg gggacacgtt 
ggcccccaac ggcgacctgt ataacgtgtt 
cctccgttcc atgcacgtct ttatcctgga 
cgccctgctg caacttacct ccgggatggt 
accgacgata tgcgacctgg cgcgcacgtt 
ggatcgatcc gtcctgtaag tctgcagaaa 
actaaaatgg aagtttttcc tgtcatactt 
attttgaatg gaaggattgg agctacgggg 
tgctctttac tgaaggctct ttactattgc 
taatttaaac aagcaaaacc aaattaaggg 
gatctataga tctctcgtgg gatcattgtt 
gtactgtggt ttccaaatgt gtcagtttca 
ttccacatac acttcattct cagtattgtt 
ctctaggccg agctccaatt cgccctatag 



acttttaatg ttaatttgct tactgtgaat 5940 
tctcactatg. tatctctgcc tgatctgcaa 6000 
tttgtctcct gaatactaag gttaaaatgt 6060 
tcttttatag gggacacact aagggagctt 6120 
gatgaaaact tgaattatta tcaccgcaac '6180 
tagagcatgc ttaagggatc cctaggactt 624.0 
gctcctggtg agagcatatt tcaaaaaaca 6300 
ttacctgtct ttaaaccatc tgcatataca 6360 
tcattcaagt tttcccccat caaattggga 6420 
ggaaactaaa caaatgttgg' ttttgtgatt 64 80 
cccatgagat acagaacaaa gctgctattt 6540 
ttctttgggt ttccctgcac tatcctgatc 6600 
gttcaaataa acttaagttt cctgtcaggt 6660 
aacatgttat atatgttaaa catttgtact 6720 
atttttaatt ttcaaaacct gagctaaata 6780 
aattgtataa ttgtggtttg caggcaagac 6840 
actttgctgg gtcacaagtc taggagtcaa 6900 
tagtgtatac tagtttatat gttggaggac 6960 
tttgactggg ctaaggaatt" gattctgatt 7020 
aggcttgaat tggagtcact tgtatatctc 7080 
tgttctttat caccct.gttt tctacataat 7140 
ccttttgaca acaatagcat gttttcaagc 7200 
tggtttttcg agacagggtt tctctgtata 7260 
accaggctgg cctcgaactc agaaatccgc 7320 
aggcgtgcac caccacgcct ggctaagttg 7380 
taactccact gggtggattt ttaattcagt 7440 
tcattaaaat ctactgttca ctctaacaga 7500 
ttcctacgga tatactagca gattaagggt 7560 
tacctaccag ctttatacta ccttgttctg 7620 
ttaactcgag ctgcaggatt cgagggcccc 7 680 
gcttttccca aggcagtctg gagcatgcgc 7740 
cacaagtggc ctctggcctc gcacacattc 7800 
gttctttggt . ggccccttcg cgccaccttc 7 8 60 
cgccccgcag ctcgcgtfcgt gcaggacgtg 7920 
cgtgcagatg gacagcaccg ctgagcaatg 7980 
atagcagctt tgctccttcg ctttctgggc 8040 
cgggctcagg ggcgggctca ggggcggggc 8100 
'attctgcacg cttcaaaagc gcacgtctgc 8160 
cctttcgacc tgcagccaat gcaccgtcct 8220 
acacgcgtct gcgttcgacc aggctgcgcg- 8.280 
gttgcgccct cgccggcagc- aagaagccac 8340 
gctactgcgg gtttatatag acggtcccca 8400 
gctggtggcc ctgggttcgc gcgacgatat 84 60 
gcgggtgctg ggggct.tccg agacaatcgc 8520 
ccagggtgag atatcggccg gggacgcggc 8580 
gggcatgcct tatgccgtga ccgacgccgt 8640 
gagctcacat gccccgcccc cggccctcac 8700 
cctcctgtgc tacccggccg cgcggtacct 8760 
ggcgttcgtg gccctcatcc cgccgacctt 8820 
tccggaggac agacacatcg accgcctggc 8880 
cctggctatg ctggctgcga ttcgccgcgt 8940 
tctgcagtgc ggcgggtcgt ggcgggagga 9000 
gccccagggt gccgagcccc agagcaacgc 9060 
atttaccctg tttcgggccc ccgagttgct 9120 
tgcctgggcc ttggacgtct tggccaaacg 9180 
ttacgaccaa tcgcccgccg gctgccggga 924 0 
ccagacccac gtcaccaccc ccggctccat 9300 
tgcccgggag atgggggagg ctaactgagg 9360 
ttgatgatct attaaacaat aaagatgtcc 9420 
tgttaagaag ggtgagaaca gagtacctac 94 80 
gtgggggtgg ggtgggatta gataaatgcc 9540 
tttatgataa tgtttcatag ttggatatca 9600 
ccagctcatt cctcccactc atgatctata 9660 
tttctcttga ttcccacttt' gtggttctaa 9720 
tagcctgaag aacgagatca gcagcctctg 9780 
ttgccaagtt ctaattccat cagaagctga 9840 
tgagtcgtat tacaattcac tggccgtcgt 9900 
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• tttacaacgt cgtgactggg aaaaccctgg cgttacccaa cttaatcgcc ttgcagcaca 9960 • 
tccccctttc gccagctggc gtaatagcga agaggcccgc accgatcgcc cttcccaaca 10020 
gttgcgcagc ctgaatggcg aatgggacgc ■ gccctgtagc ggcgcattaa gcgcggcggg 10080 
tgtggtggtt .acgcgcagcg ■ tgaccgctac acttgccagc gccctagcgc ccgctccttt; 10140 
cgctttcttc ccttcctttc tcgccacgtt cgccggcttt ccccgtcaag ctctaaatcg 10200 
ggggctccct ttagggttcc gatttagtgc tttacggcac ctcgacccca aaaaacttga 10260 
ttagggtgat ggttcacgta gtgggccatc gccctgatag acggtttttc gccctttgac 10320 
gttggagtcc acgttcttta atagtggact cttgttccaa actggaacaa cactcaaccc 10380 
tatctcggtc tattcttttg . atttataagg gattttgccg atttcggcct attggttaaa 10440 
aaatgagctg atttaacaaa aatttaacgc gaattttaac aaaatattaa cgcttacaat 10500 
ttaggtggca cttttcgggg aaatgtgcgc ggaaccccta tttgtttatt tttctaaata 10560 
cattcaaata tgtatccgct catgagacaa taaccctgat aaatgcttca ataatattga 10620. 
aaaaggaaga gtatgagtat tcaacatttc cgtgtc'gccc ttattccctt ttttgcggca 10680 
ttttgccttc ctgtttttgc tcacccagaa acgctggtga' aagtaaaaga tgctgaagat 10740 
cagttgggtg cacgagtggg ttacatcgaa ctggatctca acagcggtaa gatccttgag 10800 
agttttcgcc ccgaagaacg . ttttccaatg atgagcactt ttaaagttct gctatgtggc 10860 
gcggtattat cccgtattga cgccgggcaa gagcaactcg gtcgccgcat acactattct 10920 
cagaatgact' tggttgagta ctcaccagtc acagaaa'agc atcttacgga tggcatgaca 10980 
gtaagagaat tatgcagtgc tgccataacc atgagtgata acactgcggc caacttactt 11040 
ctgacaacga tcggaggacc gaaggagcta accgcttttt tgeacaacat gggggatcat 11100 
gtaactcgcc ttgatcgttg ggaaccggag ctgaatgaag ccataccaaa cgacgagcgt 11160 
gacaccacga tgcctgtagc aatggcaaca acgttgcgca aactattaac tggcgaacta 11220 
cttactctag cttcccggca acaattaata gactggatgg aggcggataa agttgcagga 11280 
ccacttctgc gctcggccct tccggctggc tggtttattg ctgataaatc tggagccggt 11340 
gagcgtgggt ctcgcggtat cattgcagca ctggggccag atggtaagcc ctcccgtatc 11400 
gtagttatct acacgacggg gagtcaggca actatggatg aacgaaatag acagatcgct 11460 
gagataggtg cctcactgat taagcattgg taactgtcag accaagttta ctcatatata 11520 
ctttagattg atttaaaact-' tcatttttaa tttaaaagga tctaggtgaa gatccttttt 11580 
gataatctca tgaccaaaat cccttaacgt gagttttcgt tcc'actgagc gtcagacccc 11640 
gtagaaaaga tcaaaggatc ttcttgagat ■ cctttttttc tgcgcgtaat ctgctgcttg 11700 
caaacaaaaa aaccaccgct accagcggtg gtttgtttgc cggatcaaga gctaccaact 11760 
ctttttccga aggtaactgg cttcagcaga gcgcagatac- caaatactgt ccttctagtg 11820 
tagccgtagt taggccacca cttcaagaac tctgtagcac cgcctacata cctcgctctg 11880 
ctaatcctgt taccag.tggc tgctgccagt ggcgataagt cgtgtcttac cgggttggac 11940 
tcaagacgat agttaccgga taaggcgcag cggtcgggct gaacgggggg ttcgtgcaca 12000 
cagcccagct tggagcgaac gacctacacc gaactgagat acctacagcg tgagctatga 12060 
gaaagcgcca cgcttcccga agggagaaag gcggacaggt atccggtaag cggcagggtc 12120. 
ggaacaggag agcgcacgag ggagcttcca gggggaaacg cctggtatct ttatagtcct 12180 
gtcgggtttc gccacctctg acttgagcgt. cgatttttgt gatgctcgtc aggggggcgg 12240 
.agcctatgga aaaacgccag caacgcggcc tttttacggt- tcctggcctt ttgctggcct 12300 
tttgctcaca tgttctttcc tgcgttatcc cctgattctg tggataaccg tattaccgcc 12360 
tttgagtga'g ctgataccgc tcgccgcagc cgaacgaccg agcgcagcga gtcagtgagc 12420 
gaggaagcgg- aagagcgccc aatacgcaaa ccgcctctcc ccgcgcgttg gccgattcat 12'480 
taatgcagct ggcacgacag gtttcccgac tggaaagcgg gcag-tgageg. caacgcaafct 12540- 
aatgtgagtt agctcactca ttaggcaccc caggctttac actttatgct tccggctcgt 12600 

' atgttgtgtg gaattgtgag cggataacaa tttcacacag gaaacagcta tgaccatgat 12660 
tacgccaagc gcgcaattaa ccctcactaa agggaacaaa agctgtcgag atctagatat 12720 
cgatggccat agagttacg 12739 



<210> 8 
<211> 15342 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Rosa 26 locus 
targeted with targeting vector 

<400> 8 

aagcttctca cgtagcaacc agagctccag agccagcagc tgctgccgcc ttgtatactc 60 
actcctgtga tccaacacag gagcaacctt ttctttaccc cacccccact tcttaacaca 120 
cttttttttg gggggggggg gggaacaagt gctccatgct ggaaggattg gaactatgct 180 
tttagaaagg aacaatccta aggtcacttt taaattgagg tctttgattt gaaaatcaac 240 
aaataccaaa- ttccaaatat tcgttttaat taaaccagca atgtggatat aagcattaag 300 
ttttagtttt aaaaaggtca attttccaaa cattcagcaa tcatatttaa atttacagct 360 
aggaacaaga gccttgggtc atgtcctacc aaagaacata actcaatatt ctacacatga 420 
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caatctgaat aaccttaaag cctctaatcc 
ctaatgatcc ' tcctgagaaa actggaagaa 
aaactcagaa atctctaatt atgtcagtat 
acatctaaaa taacctctta aagttttcct 
attaaccatc . aaaacacttt cctctcaata . 
ctccatctgg atcctcccca atcaaaagta 
tgtaaacatg accttggtca caaatatgtc 
gaaattcgag aacctgaatc tttttaagta 
atccaggtat agacaaaacc cagagcccag 
agacgttaca caagtaacat gagaaagcag 
caggccagca gggcgggctg cagcatcagt 
tgagaaaggc aaaggcgccc gatagaataa 
aaaactttcc ttctgagcga ttacctacta 
ctaggatctt. ggcttgcacg gattcatagg 
gt'tcttaaaa gatcgctctc cacgccctag 
ccaggctaga accctacaaa ttcaacaggg 
agggagtcca agaatgtgag gtgggggtgg 
agcagccatc tgagatagga actggaaaac 
ggggaggact gggcccccac gagcgaccag 
gtggggggct. cagggacaga aaaaaaagta 
tctcctgaaa agggtataaa cgtggagtag 
gtagggggag gggaagagtc ctgacccagg 
agatgaagga gagcctttct ctctgggcaa 
agaagacgaa aagggcaagc atcttcctgc 
cccgaggaga gggaacgcag ggagactgag 
tgtggttcgg tgtctctttt ctgttggacc 
gggcccgggc tgcggcgcac ggcactcccg 
gcggcgccac ggcgtttcct ggccgggaat 
cagaaaaggc ggagcgagcc cgagcgggga 
actactgtgt tggcggactg gcgggactag 
ggcggccgcc cctcccccgg cggcggcagc 
ctgcccgagc ggaaacgcca ctgaccgcac 
gcgggacacg ccccctcccg ccgcgccatt 
ggccggtgcg ccgccaatca gcggaggctg 
tggggctccg gctcctcaga gagcctcggc 
gcggcttggt gcgtttgcgg ggatgggcgg 
cgttctgtga gacagccggg tacgagtcgt 
aggaatgcgg tccgccctgc agcaaccgga 
caccggacgc ggccatggct cggggggggg 
tctcgtcgct gattggcttc ttttcctccc 
tggttggcgt aaggcgcctg tcagttaacg 
cgctctgccc actgggtggg gcgggaggta 
gcggtcggcc tctggcgggg cgggggaggg 
cgagcggccg cceaccctcc ccttcctctg 
cctcgtcgtc tgattggctc tcggggccca 
cgtgcaagtt.gagtccat.ee gccggccagc 
ggccctcccc tcggccccgc geegcagagt 
gaagcgcgcg ctgggggcgg ggacgggcag 
gcacgtttcc gacttgagtt gectcaagag 
etceggggag tggagggaag gagegaggge 
cttgctctcc caaagtcget ctgagttgtt 
gggagaaggc cgcacccttc teeggagggg 
gttctctget gcctcctggc ttctgaggac' 
tcttccctcg tgatctgcaa ctccagtctt 
cctgcacgtc tagggegcag tagtccaggg 
cctttttttt ccacagctcg cggttgagga 
gcaggtgact gactgagtcg actaggggta 
aagttattta taatctagaa ctagtggatc 
tacegggtag gggaggeget tttcccaagg 
ctgggcactt ggcgctacac aagtggcctc 
aggcgccaac cggctccgtt ctttggtggc 
gtcaggaagt- tcccccccgc cccgcagctc 
gcacgtctca etagtctegt gcagatggac 
ctttggggca geggecaata geagctttge 
aggggtgggt cegggggegg getcagggeg 
cctccggagg cccggcattc tgcacgcttc 
cttcctcatc tccgggcctt tcgacctgca. 



cataacaggc cacaaatttt ggacagagaa 480 
atccagggaa aagaaattcc tgtgtcctcc 54 0 
tetctgettt agtcctaggt cagattgeae 600 
cctagcgacc taaaccatta ttaatatcaa 660 
tgctgcacac aaacctcctc ctggaacctc 720 - 
taggtattta acatataagc aaggaagtaa 780 
atctaaaaac aatttagtca aggtatggag 840 
ttttgagcac aggaacaatt ggcaaaagga 900 
agctctgggc gaaaaatgag ttgctggtga 960 
aaaatgeagg tcatccacgc acccctgacc 1020 
acacaggaga aagatcctta ttcctaagaa 1080 
attagcatag aaggggcttt cccaggagtt 114 0 
aaaccagggc ttttgcccac taccatttac 1200 
ggcatatccc tccccctctt ctttagagtc 1260 
gcagggaaaa cgacaaaatc tggctcaatt 1320 
atategcaag gatactgggg catacgccac 1380 " 
cgaaggtaat gtctttggtg tgggaaaagc. 1440 
cagaggagag gcgttcagga agattatgga 1500 
agttgtcaca aggcegcaag aacaggggag 1560 
tgtgtatttt gagagcaggg ttgggaggcc 1620 
gcaataccca ggcaaaaagg ggagaccaga 1680' 
gaagacatta aaaaggtagt ggggtcgact 17.40 
gageggtgea atggtgtgta aaggtagctg 1800 
taccaggctg gggaggecca ggcccacgac 1860 
gtgacccttc tttcccccgg ggcccggtcg 1920 
cttaccttga cccaggcgct gccggggcct 1980 
ggaggcagcg agactcgagt taggcccaac 204 0 
ggcccgtacc cgtgaggtgg gggtgggggg 2100 
gggggagggc caggggegga gggggcegge 2160 
ggctgcgtga gtctctgagc geaggeggge 2220 
ggcggcagcg gcggcagctc actcagcccg 2280 
ggggattccc agtgccggcg ccaggggcac 2340 
ggcctctccg cccaccgccc cacacttatt 24 00 
ccggggccgc ctaaagaaga ggctgtgctt 2460 
taggtagggg ategggaetc tggegggagg 2520 
ccgcggcagg ccctccgagc gtggtggagc 2580' 
gacgctggaa ggggcaagcg ggtggtgggc 264 0 
gggggaggga gaagggagcg gaaaagtctc 2700 
ggggcagegg aggagegett ccggccgacg 27 60 
gccgtgtgtg aaaacacaaa tggcgtgttt 2820 
geagceggag tgcgcagccg ccggcagcct 2880 
ggtggggtga ggcgagctgg aegtgeggge 2940 
gagggagggt cagegaaagt agctcgcgcg 3000 
ggggagtcgt t-ttacccgcc gccggccggg- 30 60 
gaaaactggc ccttgccatt ggctcgtgtt 3120 
gggggeggeg. aggaggeget cccaggttcc 3180 
ctggccgcgc gcccctgcgc aacgtggcag 3240 
tagggctgag eggctgeggg gcgggtgcaa 3300 
gggcgtgctg agccagacct ccatcgcgca 3360 
tcagttgggc tgttttggag gcaggaagca 3420 
atcagtaagg gagctgeagt ggagtaggcg 3480 
ggaggggagt gttgeaatae ctttctggga 3540 
cgccctgggc ctgggagaat cccttccccc 3600 
tctaggtaac cgatatccct gcaggggtga 3660 
tttccttgat gatgtcatac ttatcctgtc 3720 
caaactcttc geggtcttte cagtactcct 3780 
accataactt cgtataatgt atgctatacg 3840 
cacgattcga gggcccctgc aggtcaattc 3900 
cagtctggag catgegcttt agcagccccg 3.960 
tggcctcgca cacattccac atccaccggt '4 020 
cccttcgcgc caccttctac tcctccccta 4080 
gcgtcgtgca ggacgtgaca aatggaagta 414 0 
agcaccgctg agcaatggaa gegggtagge 4 200 
tccttcgctt tctgggctca gaggctggga 4260 
ggctcagggg eggggeggge geccgaaggt 4320 
aaaagegcac. gtctgccgcg ctgttctcct 4380 
gecaatatgg gateggecat tgaacaagat 4440 



WO 2004/070040 



15/21 



PCT/EP2004/000968 



ggattgcacg caggttctcc ggccgcttgg 
caacagacaa tcggctgctc tgatgccgcc 
gttctttttg tcaagaccga cctgtccggt 
cggctatcgt ggctggccac gacgggcgtt 
gaagcgggaa gggactggct gctattgggc 
cac'cttgctc ctgccgagaa- agtatccatc 
cttgatccgg ctacctgccc attcgaccac 
actcggatgg aagccggtct tgtcgatcag 
gcgccagccg aactgttcgc caggctcaag* 
gtgacccatg gcgatgcctg cttgccgaat 
ttcatcgact gtggccggct gggtgtggcg 
cgtgatattg ctgaagagct tggcggcgaa 
atcgccgctc ccgattcgca gcgcatcgcc 
ggggatcgat ccgctgtaag tctgcagaaa 
actaaaatgg aagtttttcc tgtcatactt 
attttgaatg gaaggattgg agctacgggg 
tgctctttac tgaaggctct ttactattgc 
taatttaaac aagcaaaacc aaattaaggg 
gatctataga tctctcgtgg gatcattgtt 
gtactgtggt ttccaaatgt gtcagtttca 
ttccacatac acttcattct cagtattgtt 
ctctagatcc tgcaggaatt catatgataa 
ggcgcgccgg taaccgaagt tcctatactt 
ttcaagctta agcgctagaa gatgggcggg 
ggtgtgtggg cgttgtcctg caggggaatt 
teccacagat tttcggtttt gtcgggaagt 
aggataggta gtcatctggg gttttatgca 
ccgcctcgga gtattttcca tcgaggtaga 
ctcctgcttg agatccttac tacagtatga 
gcagaatttt aatcattttt aaagagccca 
tgcagcctta tcaaaaggta ttttagaaca 
ctggcttatc caacccctag acagagcatt 
tgatgactca tgaaaccaga cagattagtt 
ggcctcaaca ctgcagttct tttataactc 
tgatccttaa ttttcagtgt ctatcacctc 
attctcagtc cagggagttt. tacaacaata 
ctttccactc ccatgaatgc ctctctcctt 
attaatggtt ccaggtggat gtctcctccc 
aggctgatat tttaagacat taaaaggtat 
actgcttact aaaattttgt cattgtacac 
aaagttcagg tgtttgttgt ctttcctgac 
tttaagcagt gctttctctt ggactggctt 

'tctaaatgtg attttgccaa gcttcttcag 
acacaagtaa aatgattaag caacaaatgt' 

'gttgtgtgtg cttgtgctct ataataatac 
tcaagagcac agactgctct tccagaagtc 
gctcacaacc atctgtaatg ggatctgatg 
tgtattcaca ttaaataaat aaatcctcct 
tactgtctcc agtagaattt actgaagtaa 
agccaataat caaattactc tttaagcact 
tgaagtgtaa ctgtggacag aggagccata 
tgcagacttt aatgtctttt ctcttacact 
agtatcctat ttgtttaaac tgctagcttt 
ctgaaagcta agtctgcagc cattactaaa 
tcaaaaatgt agggccagag tttagccagc 
tcccagcact ctggaggcag agacaggcag 
acatcaagtt ctatctagga tagccaggaa 
ctctgagatt tcataaaatt ataattgaag 
taaatccgtc tacctttctg atgagatttg 
tgggtctttt gacactgtgg gctttcttta 
ttgctactaa cttcccatgg cttaaatggc 
gagatttgca gcctgatttc cagggtgggg 
ctttaatttt ttttttaaaa aatgggttat 
gtgaggtgga ctaatattaa atgagtccct 
tcttatactt aacttttttt ttaaatgtgg 
agtatacaga aactgttgca tcgcttaatc 
aaattcttca ■ cagccaaagt caaattaaga 



gtggagaggc tattcggcta tgactgggca 4500 
gtgttccggc tgtcagcgca ggggcgcccg 4560 
gccctgaatg aactgcagga cgaggcagcg 4 620 
ccttgcgcag ctgtgctcga cgttgtcact 4 680 
gaagtgccgg ggcaggatct cctgtcatct 4740 
atggctgatg caatgcggcg gctgcatacg 4800 
caagcgaaac atcgcat.cga gcgagcacgf 48 60 
gatgatctgg acgaagagca tcaggggctc 4 920 
gcgcgcatgc ccgacggcga ggatctcgtc 4980 
atcatggtgg aaaatggccg cttttctgga 5040 
gaccgctatc aggacatagc gttggctacc 5100 
tgggctgacc gcttcctcgt gctttacggt 5160 
ttctatcgcc ttcttgacga .gttcttctga 5220 
ttgatgatct attaaacaat aaagatgtcc 5280 
tgttaagaag ggtgagaaca gagtacctac 5340 
gtgggggtgg ggtgggatta gataaatgcc 5400 
tttatgataa tgtttcatag ttggatatca 54 60 
ccagctcatt cctcccactc atgatctata 5520 
tttctcttga ttcccacttt gtggttctaa • 5580 
tagcctgaag aacgagatca gcagcctctg 5640 
ttgccaagtt ctaattccat cagaagctga 5700 
cttcgtataa .tgtatgctat acgaagttat 5760 
tctagagaat aggaacttcg gaataggaac 5820 
agtcttctgg gcaggcttaa aggctaacct 5880 
gaacaggtgt aaaattggag ggacaagact 5940 
tttttaatag gggcaaataa ggaaaatggg 6000 
gcaaaactac aggttattat tgcttgtgat 6060 
ttaaagacat gctcacccga gttttatact 6120 
aattacagtg tcgcgagtta gactatgtaa 6180 
gtacttcata tccatttctc ccgctccttc- 6240 
ctcattttag ccccattttc atttattata- 6300 
ggcattttcc ctttcctgat cttagaagtc 6360 
acatacacca caaatcgagg ctgtagctgg' 6420 
cttagtacac' tttttgttga tcctttgcct' 6480 
tcccgtcagt ggtgttccac atttgggcct 6540. 
gatgtattga gaatccaacc taaagcttaa 6600 
tttctccatt tataaactga gctattaacc 6660 
.catattacct gatgtatctt acatattgcc 6720 
atttcattat tgagccacat ggtattgatt 6780 
atctgtaaaa ggtggttcct tttggaatgc 6840 
ctaaggtctt gtgagcttgt' attttttcta 6900 
gactcatggc attctacacg. ttattgctgg 6960 
gacctataat tttgcttgac ttgtagccaa 7020 
atttgtgaag cttggt.tttt- aggttgttgt, .,71380 
tatccagggg ctggagaggt ggctcggagt 7140 
ctgagttcaa ttcccagcaa ccacatggtg 7200 
ccctcttctg gtgtgtctga agaccacaag 7260 
tcttcttctt tttttttttt ttaaagagaa 7320 
tgaaatactt tgtgtttgtt ccaatatggt 7380 
ggaaatgtta ccaaggaact aatttttatt 7440 
actgcagact tgtgggatac agaagaccaa 7500 
aagcaataaa gaaataaaaa ttgaacttct 7560 
acttaacttt tgtgcttcat ctatacaaag 7 620 
catgaaagca agtaatgata attttggatt 7680 
cagtggtggt gcttgccttt atgcctttaa 7740 
atctctgagt ttgagcccag cctggtctac 7800 
tacacacaga aaccctgttg gggagggggg 7860 
cattccctaa tgagccacta tggatgtggc 7 920. 
ggtattattt tttctgtctc tgctgttggt 7980 
aagcctcctt cctgccatgt ggtctcttgt 8040 
atggcttttt- gccttctaag ggcagctgct 8100 
ttgggaaatc tttcaaacac taaaattgtc 8160 
ataataaacc tcataaaata' gttatgagga 8220 
cccctataaa agagctatta aggctttttg 8280 
tatctttaga accaagggtc ttagagtttt 8340 
agattttcta gtttcaaatc cagagaatcc 8400 
atttctgact tttaatgtta atttgcttac 8460 
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tgtgaatata aaaatgatag cttttcctga ggcagggtct cactatgtat ctctgcctga 8520 
tctgcaacaa gatatgtaga ctaaagttct gcctgctttt gtctcctgaa tactaaggtt 8580 
aaaatgtagt aatacttttg gaacttgcag gtcagattct tttatagggg acacactaag 8640 
ggagcttggg tgatagttgg taaaatgtgt ttcaagtgat gaaaacttga attattatca- 8700 
ccgcaaccta ctttttaaaa aaaaaagcca ggcctgttag agcatgctta agggatccct 87 60 
aggacttgct gagcacacaa gagtagttac ttggcaggct cctggtgaga gcatatttca 8820 
aaaaacaagg cagacaacca • agaaactaca gttaaggtta cctgtcttta aaccatctgc 8880 
atatacacag ggatattaaa atattccaaa taatatttca ttcaagtttt cccccatcaa 8940 
attgggacat ggatttctcc ggtgaatagg cagagttgga aactaaacaa atgttggttt 9000 
tgtgatttgt gaaattgttt tcaagtgata gttaaagccc atgagataca gaacaaagct 9060 
gctatttcga ggtctcttgg tttatactca gaagcacttc tttgggtttc cctgcactat 9120 
cctgatcatg tgctaggcct accttaggct gattgttgtt caaataaac't taagtttcct 9180 
gtcaggtgat gtcatatgat ttcatatat'c aaggcaaaac atgttatata tgttaaacat 9240 
ttgtacttaa tgtgaaagtt aggtctttgt gggtttgatt tttaattttc aaaacctgag 9300 
ctaaataagt catttttaca tgtcttacat ttggtggaat tgtataattg tggtttgcag 9360 . 
gcaagactct ctgacctagt aaccctacct atagagcact ttgctgggtc acaagtctag 9420 
gagtcaagca tttcaccttg aagttgagac gttttgttag tgtatactag tttatatgtt 9480 
ggaggacatg tttatccaga agatattcag gactattttt gactgggcta aggaattgat 9540 
tctgattagc actgttagtg agca.ttgagt ggcctttagg cttgaattgg agtcacttgt 9600 
atatctcaaa taatgctggc cttttttaaa aagcccttgt tctttatcac cctgttttct 9660 
acataatttt tgttcaaaga aatacttgtt tggatctcct tttgacaaca atagcatgtt 9720 
ttcaagccat attttttttc cttttttttt ttttttttgg tttttcgaga cagggtttct 9780 
ctgtatagcc ctggctgtcc tggaactcac tttgtagacc aggctggcct cgaactcaga ■ 9840 
aatccgcctg cctctgcctc ctgagtgccg ggattaaagg cgtgcaccac cacgcctggc 9900 
taagttggat attttgttat ataactataa ccaatactaa ctccactggg tggattttta 9960 
attcagtcag tagtcttaag tggtctttat tggcccttca ttaaaatcta ctgttcactc 10020 
taacagaggc tgttggtact agtggcactt aagcaacttc ctacggatat actagcagat 10080 
taagggtcag ggatagaaac tagtctagcg ttttgtatac ctaccagctt tatactacct 10140 

. tgttctgata gaaatatttc aggacatcta gagtgtacta taaggttgat ggtaagctta 10200 
taaggaactt' gaaagtggag taactactcc atttctctga ggggagaatt aaaatttttg 10260 
accaagtgtt gttgagccac tgagaatggt ctcagaacat aacttcttaa ggaaccttcc 10320 
cagattgccc tcaacactgc accacatttg gtcctgcttg aacattgcca tggctcttaa 10380 

" agtcttaatt aagaatatta attgtgtaat tattgttttt cctcctttag atcattcctt 10440 
gaggacagga cagtgcttgt ttaaggctat atttctgctg tctgagcagc aacaggtctt 10500 
cgagatcaac atgatgttca taatcccaag atgttgccat ttatgttctc agaagcaagc 10560 
agaggcatga tggtcagtga cagtaatgtc actgtgttaa atgttgctat gcagtttgga 10620 
tttttctaat gtagtgtagg tagaacatat gtgttctgta tgaattaaac tcttaagtta 10680 
caccttgtat aatccatgca atgtgttatg caattaccat tttaagtatt gtagctttct 10740 
ttgtatgtga ggataaaggt gtttgtcata aaatgttttg aacatttccc caaagttcca 10800 
aattataaaa ccacaacgtt agaacttatt tatgaacaat ggttgtagtt tcatgctttt 1.0860-- 
aaaatgctta attattcaat taacaccgtt . tgtgttataa tatatataaa actgacatgt . 10920 . 
agaagtgttt gtccagaa'ca . tttcttaaat gtatactgtc. tttagagagt ttaatatagc 10980 
atgtcttttg caacatacta acttttgtgt tggtgcgagc aatattgtgt agtcattttg 11040 

. aaaggagtca tttcaatgag * tgtcagattg ttttgaatgt tattgaacat tttaaatgca 1.110Q 
gacttgttcg tgttttagaa agcaaaactg tcagaagctt tgaacta'gaa attaaaaagc 11160 
tgaagtattt cagaagggaa ataagctact tgctgtatta gttgaaggaa agtgtaatag 11220 
cttagaaaat ttaaaaccat atagttgtca ttgctgaata tctggcagat gaaaagaaat 11280 
actcagtggf tcttttgagc aatataacag cttgttatat taaaaatttt ccccacagat .11340 
ataaactcta atctataact cataaatgtt acaaatggat gaagcttaca aatgtggctt 11400 
gacttgtcac tgtgcttgtt ttagttatgt gaaagtttgg caataaacct atgtcctaaa 114 60 
tagtcaaact gtggaatgac tttttaatct attggtttgt ctagaacagt tatgttgcca 11520 
tttgccctaa tgatqaaaga aaaagtgggg agtgccttgg cactgttcat ttgtggtgtg H 58 0 
aaccaaagag gggggcatgc acttacactt caaacatcct tttgaaagac tgacaagttt llt>40 
gggtcttcac agttggaatt gggcatccct tttgtcaggg agggagggag ggagggaggc 11700 
tggcttgtta tgctgacaag tgtgattaaa ttcaaacttt gaggtaagtt ggaggaactt 117 60 
gtacattgtt aggagtgtga caatttggac tcttaatgat ttggtcatac aaaatgaacc 11820 
tagaccaact tctggaagat gtatataata actccatgtt acattgattt cacctgacta 11880 
atacttatcc cttatcaatt aaatacagaa gatgccagcc atctgggcct tttaacccag 11940 
aaatttagtt tcaaactcct aggttagtgt tctcactgag ctacatcctg atctagtcct 12000 
gaaaatagga ccaccatcac ccccaaaaaa .atctcaaata agatttatgc tagtgtttca .12060 
aaattttagg aataggtaag attagaaagt tttaaatttt gagaaatggc ttctctagaa 12120 
agatgtacat agtgaacact gaatggctcc taaagagcct agaaaactgg tactgagcac 12180 
acaggactga gaggtctttc ' ttgaaaagca tgtattgctt tacgtgggtc acagaaggca 12240 
ggcaggaaga acttgggctg aaactggtgt cttaagtggc taacatcttc acaactgatg 12300 
agcaagaact ttatcctgat gcaaaaacca tccaaacaaa ctaagtgaaa ggtggcaatg 12360 
gatcccaggc tgctctagag gaggacttga cttctcatcc catcacccac accagatagc 12420 
tcatagactg ccaattaaca ccagcttcta gcctccacag gcacctgcac tggtacacat 12480 
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aatttcacac aaacacagta agaagccttc cacctggcat ggtattgctt atctttagtt 12540 
cccaacactt gggaggcaga ggccagccag ggctatgtga caaaaacctt gtctagagga 12600 
gaaacttcat agcttatttc ctattcacgt aaccaggtta gcaaaattta ccagccagag 12660 
atgaagctaa cagtgtccac tatatttgta gtgttttaag tcaatttttt aaatatactt 12720 
aatagaatta aagctatggt gaaccaagta caaacctggt gtattaactt gagaacttag 12780 
cataaaaagt agttcatttg ttcagtaaat attaaatgct tactggcaaa gattatgtca 1284 0 
ggaacttggt aaatggtgat gaaacaatca tagttgtaca tcttggttct gtgatcacct 12900 
tggtttgagg taaaagtggt tcctttgatc aaggatggaa ttttaagttt atattcaatc 12960 
aataatgtat tattttgtga ttgcaaaatt gcctatctag ggtataaaac ctttaaaaat 13020 
ttcataatac cagttcattc tccagttact aattccaaaa agccactgac tatggtgcca 13080 
atgtggattc tgttctcaaa ggaaggattg tctgtgccct ttattctaat agaaacatca. 13140 
cactgaaaat ctaagctgaa agaagccaga ■ ctttcctaaa taaataactt tccataaagc 13200 
tcaaacaagg attactttta ggaggcactg ttaaggaact gataa'gtaat gaggttactt 13260 
atataatgat agtcccacaa gactatctga ggaaaaatca gtacaactcg aaaacagaac 13320 
aaccagctag gcaggaataa cagggctccc aagtcaggag gtctatccaa cacccttttc 13380 
tgttgagggc cccagaccta • catattgtat acaaacaggg aggtgggtga . ttttaactct 13440 
cctgaggtac cttggtaaat ctttgtcctg agtaagcagt acagtgtaca gtttacattt 13500 
tcatttaaag atacattagc tccctctacc. ccctaagact gacaggcact ttgggggtgg 13560 
ggagggcttt ggaaaataac gcttccatac actaaaagag aaatttcttt aattaggctt 13620 
gttggttcca tacatctact ggtgtttcta ctacttagta atattataat agtcacacaa 13680 
gcatctttgc tctgtttagg ttgtatattt attttaaggc agatgataaa actgtagatc 13740 
ttaagggatg cttctgcttc tgagatgata caaagaattt agaccataaa acagtaggtt 13800' 
gcacaagcaa tagaatatgg cctaaagtgt tctgacactt agaagccaag cagtgtaggc 13860 
ttcttaagaa ataccattac aatcaccttg ctagaaatca agcattctgg agtggtcaag 13920 
cagtgtaacc tgtactgtaa gttacttttc tgctattttt ctcccaaagc aagttcttta 13980 
tgctgatatt tccagtgtta* ggaactacaa atattaataa gttgtcttca ctcttttctt 14040 
taccaaggag ggtctcttcc ttcatcttga tctgaaggat. gaacaaaggc ttgagcagtg 14100 
cgctttagaa gataaactgc agcatgaagg cccccgatgt tcacccagac tacatggacc 14160 
tttcgccaca catgtcccat tccagataag gcctggcaca cacaaaaaac ataagtcatt 14220 
aggctaccag tctgattcta aaacaaccta aaatcttccc acttaaatgc tatgggtggt 14280 
gggttggaaa gttgactcag aaaatcactt gctgttttta gagaggatct gggttcagtt 14340 
tctgatacat tgtggcttac aactataact ccagttctag ggggtccatc caacatcctc 14400 
ttctgttgag ggcaccaaat aaatgtattg tgtacaaaca gggaggtgag tgatttaact 144 60 
ctcgtgtata gtaccttggt aaaacatttc ttgtcctgag taagcagtac agctctgcct 14520 
gtccctggtc tacagacacg gctcatttcc. cgaaggcaag ctggatagag attccaattt 14580 
ctcttcttgg atcccatcct ataaaagaag gtcaagttta atctattgca aaaggtaaat 14 640 
aggtagtttc ttacatgaga caagaacaaa tcttaggtgt gaagcagtca tcttttacag 14700 
gccagagcct' ctattctatg ccaatgaagg aaactgttag tccagtgtta tagagttagt 147.60 
ccagtgtata gttttctatc agaacacttt ttttttaaac aactgcaact tagcttattg 14820 
aagacaaacc acgagtagaa atctgtccaa gaagcaagtg cttctcagcc tacaatgtgg 1.4880 
aataggacca tgtaatggta cagtgagtga aatgaattat ggcatgtttt tctgactgag 14940 
aagacagtac aataaaaggt aaactcatgg tatttattta aaaagaatcc aatttctacc 15000 
tttttccaaa tggcatatct gttacaataa tatccacaga agcagttctc agtgggaggt 15060 
.tgcagatatc ccactgaaca gca.tcaatgg gcaaacccca. ggttgttttt ctgtggagac 15120 
aaaggtaaga tatttcaata tattttccca agctaatgag atggctcagc aaataatggt 15180 
actggccatt aagtctcatg acctgagctt gatcctcagg gaccatgtgg tacaaggaga 15240 
gacctaaatc cttcagttgg acttcaatct tctaccctca tgtccacaca caaataaata 15300 
caataaaaaa cattctgcag tcgaatttct aaaagggcga at 15342 



<210> 9 
<211> 1823 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PGK-neo-pA 
<400> 9 

ggtcaattct accgggtagg ggaggcgctt ttcccaaggc agtctggagc atgcgcttta 60 
gcagccccgc tgggcacttg gcgctacaca agtggcctct ggcctcgcac acattccaca 120 
tccaccggta ggcgccaacc ggctccgttc tttggtggcc ccttcgcg;cc accttctact 180 
cctcccctag tcaggaagtt cccccccgcc ccgcagctcg cgtcgtgcag gacgtgacaa 240 
atggaagtag cacgtctcac tagtctcgtg cagatggaca gcaccgctga gcaatggaag 300 
cgggtaggcc tttggggcag cggccaatag cagctttgct ccttcgcttt ctgggctcag 360 
aggctgggaa- ggggtgggtc cgggggcggg ctcagggcgg gctcaggggc ggggcgggcg 420 
cccgaaggtc ctccggaggc ccggcattct gcacgcttca aaagcgcacg tctgccgcgc 480 
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tgttctcctc ttcctcatct ccgggccttt cgacctgcag ccaatatggg atcggcca'tt 540 
gaacaagatg.gattgcacgc aggttctccg gccgcttggg tggagaggct attcggctat 600 
gactgggcac aacagacaat cggctgctct gatgccgccg tgttccggct gtcagcgcag 660 
gggcgcccgg ttctttttgt caagaccgac ctgtccggtg ccctgaatga actgcaggac 720 
gaggcagcgc ggctatcgtg gctggccacg acgggcgttc cttgcgcagc tgtgctcgac 780 

.gttgtcactg aagcgggaag ggactggctg ctattgggcg aagtgccggg gcaggatctc 840 
ctgtcatctc accttgctcc tgccgagaaa' gtatccatca tggctgatgc aatgcggcgg 900 
ctgcatacgc ttgatccggc tacct.gccca ttcgaccacc aagcgaaaca tcgcatcgag 960 
cgagcacgta ctcggatgga agccggtctt gtcgatcagg atgatctgga cgaagagcat 1020 
caggggctcg cgccagccga actgttcgcc aggctcaagg cgcgcatgcc cgacggcgag 1080 
gatctcgtcg tgacccatgg. cgatgcctgc ttgccgaata tcatggtgga .aaatggccgc 1140. 
ttttctggat tcatcgactg tggccggctg ggtgtggcgg accgctatca ggacatagcg 1200 

. ttggctaccc gtgatattgc tgaagagctt ggcggcgaat" gggctgaccg cttcctcgtg 1260. 

. ctttacggta tcgccgctcc cgattcgcag cgcatcgcct tctatcgcct tcttgacgag 1320 
ttcttctgag gggatcgatc cgctgtaagt ctgcagaaat tgatgatcta ttaaacaata 1380 
■aagatgtcca ctaaaatgga agtttttcct gtcatacttt gttaagaagg gtgagaacag 1440 
agtacctaca ttttgaatgg aaggattgga gctacggggg tgggggtggg gtgggattag 1500 
ataaatgcct gctctttact gaaggctctt tactattgct ttatgataat gtttcatagt 1560 
tggatatcat aatttaaaca agcaaaacca aattaagggc cagctcattc ctcccactca 1620 
tgatctatag atctatagat ctctcgtggg atcattgttt ttctcttgat tcccactttg 1680 
tggttctaag tactgtggtt tccaaatgtg tcagtttcat agcctgaaga acgagatcag 1740 
cagcctctgt tccacataca cttcattctc agtattgttt *tgccaagttc taattccatc 1800 
agaagctgac tctagatcct gca 1823 

<210> 10 

<211> 1867 

<212> DNA • ' 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: CAGGS-Intron 

<400> 10 ' 
tcgacattga ttattgacta gttattaata gtaatcaatt acggggtcat tagttcatag 60 . 
cccatatatg gagttccgcg ttacataact tacggtaaat ggcccgcctg gctgaccgcc 120 
caacgacccc cgcccattga cgtcaataat gacgtatgtt cccatagtaa cgccaatagg 180 
gactttccat tgacgtcaat .gggtggacta tttacggtaa actgcccact tggcagtaca 240 
tcaagtgtat catatgccaa gtacgccccc ' tattgacgtc aatgacggta aatggcccgc 300 
ctggcattat gcccagtaca tgaccttatg ggactttcct acttggcagt acatctacgt 360 
attagtcatc gctattacca tgggtcgagg tgagccccac gttctgcttc actctcccca 420 
tctccccccc ctccccaccc ccaattttgt atttatttat tttttaatta ttttgtgcag . 480 
cgatgggggc gggggggggg ggggcgcgcg • ccaggcgggg cggggcgggg cgaggggcgg 54 0 
ggcggggcga ggcggagagg tgcggcggca gccaatcaga gcggcgc.gct ccgaaa.gt.tt ..600 .. 
ccttttatgg cgaggcggcg gcggcggcgg ccctataaaa agcgaagcgc gcggcgggcg 660. 
ggagtcgctg cgttgccttc gccccgtgcc ccgctccgcg ccgcctcgcg ccgcccgccc 720 
cggctctgac tgaccgcgtt actcccacag gtgagcgggc gggacggccc ttctcctccg -780 
ggctgtaatt agcgcttg.gt ttaatgacgg ctcgtttctt ttctgtggct gcgtgaaagc 840 
cttaaagggc tccggqaggg ccctttgtgc gggggggagc ggctcggggg' gtgcgtgcgt 900 
gtgtgtgtgc gtggqgagcg ccgcgtgcgg cccgcgctgc ccggcggctg tgagcgctgc 960^ 
gggcgcggcg cggggctttg tgcgctccgc gtgtgcgcga ggggagcgcg gccgggggcg 1020 
gtgccccgcg gtgcgqggqg gctgcgaggg gaacaaaggc tgcgtgcggg gtgtgtgcgt 1080 
gggggggtga gcagggqgtg tgggcgcggc ggtcgggctg taaccccccc ctgcaccccc 1140 
ctccccgagt tgctgagcac ggcccggctt cgggtgcggg gctccgtgcg gggcgtggcg 1-00 
cggqgctcgc cgtgccgggc ggggggtggc ggcaggtggg ggtg.ccgggc ggggcggggc 1260 
cgcctcgggc cggggagggc tcgggggagg ggcgcggcgg ccccggagcg ccggcggctg 1320 
tcgaggcgcg gcgagccgca gccattgcct tttatggtaa tcgtgcgaga gggcgcaggg 1380 
acttcctttg tcccaaatct ggcggagccg aaatctggga ggcgccgccg caccccctct 1440 
agcgggcgcg ggcgaagcgg tgcggcgccg gcaggaagga aatgggcggg gagggccttc 1500 
gtgcgtcgcc gcgccgccgt ccccttctcc atctccagcc tcggggctgc cgcaggggga 15t>0 
cggctgcctt cgggggggac ggggcagggc ggggttcggc ttctggcgtg tgaccggcgg 1620 
ctctagtaag cgttggggtg agtactccct ctcaaaagcg ggcatgactt ctgcgctaag 1680 
attgtcagtt tccaaaaacg aggaggattt gatattcacc tggcccgcgg tgatgccttt 1740 
gagggtggcc gcgtccatct ggtcagaaaa gacaatcttt. ttgttgtcaa gcttgaggtg 1800 . 
tggcaggctt gagatctggc catacacttg agtgacattg acatccactt tgcctttctc 1860 
tccacag * 1867 
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<210> 11 
<211> 703 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: ZsGreen ORF 
<400> 11 

cgccaccatg gcccagtcca agcacggcct gaccaaggag atgaccatga agtaccgcat 60 
ggagggctgc gtggacggcc acaagttcgt gatcaccggc gagggcatcg gctacccctt 120 
caagggcaag caggccatca acctgtgcgt .ggtggagggc - ggccccttgc ccttcgccga 180 
ggacatcttg tccgccgcct tcatgtacgg caaccgcgtg ttcaccgagt acccccagga 24 0 
catcgtcgac tacttcaaga actcctgccc cgccggctac acctgggacc gctccttcct 300 
gttcgaggac ggcgccgtgt gcatctgcaa. cgccgacatc accgtgagcg tggaggagaa 360 
ctgcatgtac cacgagtcca agttctacgg cgtgaacttc cccgccgacg gccccgtgat 420 
gaagaagatg accgacaact gggagccctc ctgcgagaag atcatccccg tgcccaagca 480 
gggcatcttg aagggcgacg tgagcatgta cctgctgctg aaggacggtg gccgcttgcg 540 
ctgccagttc gacaccgtgt' acaaggccaa gtccgtgccc cgcaagatgc ccgactggca 600 
cttcatccag cacaagctga cccgcgagga ccgcagcgac gccaagaacc agaagtggca 660 
cctgaccgag cacgccatcg cctccggctc ' cgccttgccc tga 703 



<210> 12 
<211> 36 
<2'12> DNA 

,<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: pA 
<400> 12- 

aaataaagca atagcatcac aaatttcaca aataaa 



<210> 13 ... 

<211>-7078 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: . vector . 

consisting -.of a ,FRTV flanked .neomycin resistance....,,, , . 
gene and a PGK-TK-pA expression cassette 

<400> 13 

taaagggaac aaaagctgtc gagatctaga tatcgatggc catagagtta cgctagggat 60 

aacagggtaa tatagccgcg gtcgcctagg taaccgaagt tcctatactt tctagagaat 120 

aggaacttcg gaataggaac ttcttaggtc aattctaccg. ggtaggggag gcgcttttcc 180 

caaggcagtc tgqagcatgc gctttagcag ccccgctggg cacttggcgc tacacaagtg 240 

gcctctggcc tcgcacacat tccacatcca ccggtaggcg ccaaccggct ccgttctttg" 300 

gtggcccctt cgcgccacct tctactcctc ccctagtcag gaagttcccc cccgccccgc 360 

agctcgcgtc gtgcaggacg tgacaaatgg aagtagcacg tctcactagt ctcgtgcaga 420 

tggacagcac cgctgagcaa tggaagcggg taggcctttg gggcagcggc caatagcagc 4 80 

tttgctcctt cgctttctgg gctcagaggc tgggaagggg tgggtccggg ggcgggctca 540 

ggggcgggcf caggggcggg gcgggcgccc gaaggtcctc cggaggcccg gcattctgca 600 

cgcttcaaaa gcgcacgtct gccgcgctgt tctcctcttc ctcatctccg ggcctttcga 660 

cctgcagcca atatgggatc ggccattgaa caagatggat tgcacgcagg ttctccggcc 720 

•gcttgggtgg. agaggctatt cggctatgac tgggcacaac agacaatcgg ctgctctgat 780 

gccgccgtgt tccggctgtc agcgcagggg cgcccggttc tttttgtcaa gaccgacctg 840 

tccggtgccc tgaatgaact. gcaggacgag gcagcgcggc tatcgtggct ggccacgacg 900 ' 

ggcgttcctt gcgcagctgt. gctcgacgtt gtcactgaag cgggaaggga ctggctgcta 960 

ttgggcgaag tgccggggca ggatctcctg tcatctcacc ttgctcctgc cgagaaagta 1020 

tccatcatgg ctgatgcaat gcggcggctg catacgcttg atccggctac ctgcccattc 1080 

gaccaccaag cgaaacatcg catcgagcga gcacgtactc ggatggaagc cggtcttgtc 1140 

gatcaggatg atctggacga agagcatcag gggctcgcgc cagccgaact gttcgccagg 1200 
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ctcaaggcgc gcatgcccga cggcgaggat 
ccgaatatca tggtggaaaa tggccgcttt 
gtggcggacc gctatcagga catagcgttg 
ggcgaatggg ctgaccgctt cctcgtgctt 
atcgccttct atcgccttct tgacgagttc 
cagaaattga tgatctatta aacaataaag 
atactttgtt aagaagggtg agaacagagt 
acgggggtgg gggtgg'ggtg ggattagata 
tattgcttta tgataatgtt tcatagttgg 
taagggccag ctcattcctc. ccactcatga 
attgtttttc tcttgattcc; cactttgtgg 
gtttcatagc ctgaagaacg agatcagcag 
attgttttgc caagttctaa ttccatcaga 
tatactttct agagaatagg aacttcggaa 
gttaactcga gctgcaggat • tcgagggccc 
cgcttttccc aaggcagtct ggagcatgcg 
acacaagtgg cctctggcct cgcacacatt 
cgttctttgg tggccccttc gcgccacctt 
ccgccccgca gctcgcgtcg tgcaggacgt 
tcgtgcagat ggacagcacc gctgagcaat 
aatagcagct ttgctccttc gctttctggg 
gcgggctcag gggcgggctc aggggcgggg 
cattctgcac gcttcaaaag cgcacgtctg 
gcctttcgac ctgcagccaa tgcaccgtcc 
aacacgcgtc tgcgttcgac caggctgcgc 
cgttgcgccc tcgccggcag caagaagcca 
.cgctactgcg ggtttatata gacggtcccc 
tgctggtggc cctgggttcg cgcgacgata 
ggcgggtgct gggggcttcc gagacaatcg 
accagggtga gatatcggcc ggggacgcgg 
tgggcatgcc ttatgccgtg accgacgccg 
ggagctcaca tgccccgccc ccggccctca 
ccctcctgtg ctacccggcc gcgcggtacc 
tggcgttcgt ggccctcatc ccgccgacct 
ttccggagga cagacacatc gaccgcctgg 
acctggctat gctggctgcg attcgccgcg 
atctgcagtg cggcgggtcg tggcgggagg 
cgccccaggg tgccgagccc .cagagcaacg 
tatttaccct gtttcgggcc cccgagttgc 
ttgcctgggc cttggacgtc ttggccaaac 
attacgacca atcgcccgcc ggctgccggg 
tccagaccca cgtcaccacc cccggctcca 
ttgcccggga gatgggggag gctaactgag 
attgatgat.c tat.taaacaa. taaagatgtc 
ttgttaagaa gggtgagaac agagtaccta 
ggtgggggtg gggtgggatt agataaatgc 
ctttatgata atgtttcata gttggatatc 
gccagctcat tcctcccact catgatctat 
tt-ttctcttg attcccactt tgtggttcta 
atagcctgaa gaacgagatc agcagcctct 
tttgccaagt tctaattcca tcagaagctg 
gtgagtcgta ttacaattca ctggccgtcg 
gcgttaccca acttaatcgc cttgcagcac 
aagaggcccg caccgatcgc ccttcccaac 
cgccctgtag cggcgcatta agcgcggcgg 
cacttgccag cgccctagcg cccgctcctt 
tcgccggctt tccccgtcaa gctctaaatc 
ctttacggca cctcgacccc aaaaaacttg 
cgccctgata gacggttttt cgccctttga 
tcttgttcca aactggaaca acactcaacc 
ggattttgcc gatttcggcc tattggttaa 
cgaattttaa .caaaatatta acgcttacaa 
cggaacccct atttgtttat ttttctaaat 
ataaccctga taaatgcttc aataatattg 
ccgtgtcgcc cttattccct tttttgcggc 
aacgctggtg aaagtaaaag atgctgaaga 
actggatctc aacagcggta agatccttga 



ctcgtcgtga cccatggcg'a tgcctgcttg 1260 
tctggattca tcgactgtgg ccggctgggt 1320 
gctacccgtg atattgctga agagcttggc 1380 
tacggtatcg ccgctcccga ttcgcagcgc 1440 
ttctgagggg atcgatccgc tgtaagtctg 1500 
atgtccacta .aaatggaagt ttttcctgtc 1560 
acctacattt tgaatggaag gattggagct 1620 
aatgcctgct ctttactgaa ggctctttac 1680 
atatcataat ttaaacaagc aaaaccaaat 1740 
tctatagatc tatagatctc tcgtgggatc 1800 
ttctaagtac tgtggtttcc aaatgtgtca 1860 
cctctgttcc acatacactt cattctcagt 1920 
agctgactct agatcccgcg ccgaagttcc .1980 
taggaacttc aagcttaagc gctagcacgt 2040 
cggcaggtca attctaccgg gtaggggagg 2100 
ctttagcagc cccgctgggc acttggcgct 2160 
ccacatccac cggtaggcgc caaccggctc 2220 
ctactcctcc cctagtcagg aagttccccc 2280 
gacaaatgga agtagcacgt ctcactagtc 2340 
ggaagcgggt aggcctttgg ggcagcggcc 2400 
ctcagaggct gggaaggggt gggtccgggg 24 60 
cgggcgcccg aaggtcctcc ggaggcccgg 2520 
ccgcgctgtt ctcctcttcc tcatctccgg 2580 
ttgccatcat ggcctcgtac ' cccggccatc 2640 
gttctcgcgg ccatagcaac cgacgtacgg 2700 
cggaagtccg cccggagcag aaaatgccca 27 60 
acgggatggg gaaaaccacc accacgcaac 2820 
tcgtctacgt acccgagccg atgacttact 2880' 
cgaacatcta caccacacaa caccgcctcg 2940 
cggtggtaat gacaagcgcc cagataacaa 3000 
ttctggctcc tcatatcggg ggggaggctg 3060 
ccctcatctt cgaccgccat cccatcgccg 3120 
ttatgggcag catgaccccc caggccgtgc 3180 
tgcccggcac caacatcgtg cttggggccc 324 0 
ccaaacgcca • gcgccccggc gagcggctgg 3300 
tttacgggct acttgccaat acggtgcggt 3360 
actggggaca gctttcgggg acggccgtgc 3420 
cgggcccacg accccatatc ggggacacgt 34 80 
tggcccccaa cggcgacctg tataacgtgt 354 0 
gcctccgttc catgcacgtc tttatcctgg 3600 
acgccctgct gcaacttacc tccgggatgg. 3660 
•taccgacgat atgcgacctg gcgcgcacgt 3720 . 
gggatcgatc cgtcctgtaa gtctgcagaa 3780 
cactaaaatg .gaagtttttc . ctgtcatact .. 384.0 
cattttgaat ggaaggattg gagctacggg 3900 
ctgctcttta ctgaaggctc tttactattg 3960 
ataatttaaa caagcaaaac caaattaagg 4020 
agatctatag atctctcgtg ggatcattgt 4080 
agtactgtgg tttccaaatg tgtcagtttc 4140 
gttccacata cacttcattc tcagtattgt 4200 
actctaggcc gagctccaat tcgccctata 4260 
ttttacaacg tcgtgactgg gaaaaccctg 4320 
atcccccttt cgccagctgg cgtaatagcg 4 380 
agttgcgcag cctgaatggc gaatgggacg 4 4 40 
gtgtggtggt tacgcgcagc gtgaccgcta 4 500 
tcgctttctt cccttccttt ctcgccacgt 4560 
gggggctccc tttagggttc cgatttagtg 4 620 
attagggtga tggttcacgt agtgggccat 4 680 
cgttggagtc cacgttcttt aatagtggac 4740 
ctatctcggt ctattctttt gatttataag- 4800. 
aaaatgagct gatttaacaa aaatttaacg 4860 
tttaggtggc acttttcggg gaaatgtgcg 4 92-0 
acattcaaat atgtatccgc tcat'gagaca 4 980 
aaaaaggaag agtatgagta ttcaacattt 504 0 
attttgcctt cctgtttttg ctcacccaga 5100 
tcagttgggt gcacgagtgg gttacatcga 5160 • 
gagttttcgc cccgaagaac gttttccaat 5220. 
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gatgagcact tttaaagttc tgctatgtgg cgcggtatta tcccgtattg acgccgggca 5280 
agagcaactc ggtcgccgca tacactattc tcagaatgac ttggttgagt actcaccagt 5340 
cacagaaaag catcttacgg atggcatgac agtaagagaa ttatgcagtg ctgccataac 5400 
catgagtgat aacactgcgg ccaacttact tctgacaacg atcggaggac cgaaggagct 54 60 
aaccgctttt ttgcacaaca tgggggatca tgtaactcgc cttgatcgtt gggaaccgga 5520 
gctgaatgaa gccataccaa acgacgagcg tgacaccacg atgcctgtag caatggcaac 5580 
aacgttgcgc aaactattaa ctggcgaact acttactcta gcttcccggc aacaattaat • 5640 
agactggatg gaggcggata aagttgcagg accacttctg cgctcggccc ttccggctgg ^5700 
ctggtttatt gctgataaat ctggagccgg tgagcgtggg tctcgcggta tcattgcagc 57 60 
actggggcca gatggtaagc cctcccgtat cgtagttatc tacacgacgg ggagtcaggc 5820 
aactatggat gaacgaaata gacagatcgc tgagataggt gcctcactga ttaagcattg 5880 
gtaactgtca gacc'aagttt actcatatat actttagatt gatttaaaac ttcattttta 5940 
•atttaaaagg atctaggtga agatcctttt tgataatctc atgaccaaaa tcccttaacg 6000 
tgagttttcg ttccactgag cgtcagaccc cgtagaaaag atcaaaggat cttcttgaga 6060 
tccttttttt ctgcgcgtaa tctgctgctt gcaaacaaaa aaaccaccgc taccagcggt 6120 
ggtttgtttg ccggatcaag agctaccaac tctttttccg aaggtaactg gcttcagcag* 6180 
agcgcagata ccaaatactg tccttctagt' gtagccgtag ttaggccacc acttcaagaa 6240 
ctctgtagca ccgcctacat acctcgctct gctaatcctg ttaccagtgg ctgctgccag 6300 
tggcgataag tcgtgtctta ccgggttgga ctcaagacga tagttaccgg ataaggcgca 6360 
gcggtcgggc tgaacggggg gttcgtgcac acagcccagc ttggagcgaa cgacctacac 6420 
cgaactgaga tacctacagc gtgagctatg agaaagcgcc acgcttcccg aagg'gagaaa 6480 
ggcggacagg tatccggtaa gcggcagggt .cggaacagga gagCgcacga gggagcttcc 654 0 
agggggaaac gcctggtatc tttatagtcc tgtcgggttt cgccacctct gacttgagcg . 6600 
tcgatttttg tgatgctcgt caggggggcg gagcctatgg aaaaacgcca gcaacgcggc 6660 
ctttttacgg ttcctggcct tttgctggcc ttttgctcac atgttctttc ctgcgttatc 6720 
ccctgattct gtggataacc gtattaccgc ctttgagtga gctgataccg ctcgccgcag 6780 
ccgaacgacc gagcgcagcg agtcagtgag cgaggaagcg gaagagcgcc caatacgcaa 6840 
accgcctctc cccgcgcgtt ggccgattca ttaatgcagc tggcacgaca ggtttcccga ■ 6900 
ctggaaagcg ggcagtgagc gcaacgcaat taatgtgagt tagctcactc' attaggcacc 6960 
ccaggcttta cactttatgc ttccggctcg tatgttgtgt ggaattgtga gcggataa.ca 7020 
atttcacaca ggaaacagct atgaccatga ttacgccaag cgcgcaatta accctcac 7078 
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